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Foreword 


Diophantine problems represent some of the strongest aesthetic 
attractions to algebraic geometry. They consist in giving criteria 
for the existence of solutions of algebraic equations in rings and 
fields, and eventually for the number of such solutions. 

The fundamental ring of interest is the ring of ordinary integers 
Z, and the fundamental field of interest is the field Q of rational 
numbers. One discovers rapidly that to have all the technical 
freedom needed in handling general problems, one must consider 
rings and fields of finite type over the integers and rationals. 
Furthermore, one is led to consider also finite fields, p-adic fields 
(including the real and complex numbers) as representing a localiza- 
tion of the problems under consideration. 

We shall deal with global problems, all of which will be of a 
qualitative nature. On the one hand we have curves defined over 
say the rational numbers. If the curve is affine one may ask for 
its points in Z, and thanks to Siegel, one can classify all curves 
which have infinitely many integral points. This problem is 
treated in Chapter VII. One may ask also for those which have 
infinitely many rational points, and for this, there is only Mordell’s 
conjecture that if the genus is = 2, then there is only a finite 
number of rational points. 

On the other hand, we have abelian varieties. If A is an abelian 
variety defined over Q, then its group of rational points is finitely 
generated (Mordell-Weil theorem). The proofs do not give a con- 
structive method to get hold of the generators. Abelian varieties 
include of course curves of genus 1, which have been the funda- 
mental testing ground for conjectures, theorems, and proofs in the 
theory. 

As a curve of genus = 1 is characterized by the fact that it is 
embeddable in an abelian variety, one is led to study subvarieties 
of abelian varieties, and especially Mordell’s conjecture, in this 
light. A curve of genus = 2 is characterized by the fact that it 
is unequal to any non-zero translation of itself in its Jacobian. 
This is a very difficult hypothesis to use. 

It is also interesting to look at subvarieties of other group 
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varieties. For us, the natural ones to consider are the toruses, 
i.e. products of multiplicative groups. It will be shown that if G 
is such a group, V a curve contained in G, J’ a finitely generated 
subgroup of G, and V nT is infinite then V is the translation of 
a subtorus (in characteristic 0). 

For various generalizations, and comments on the above results 
and problems, see Chapters VI and VII and the Historical Notes 
at the end of these chapters. 

The prototypes of the diophantine results and methods given in 
this book were developed by Mordell, Weil, and Siegel between 
1920 and 1930. Considering recent developments in the techniques 
of algebraic geometry, and Roth’s theorem, it seemed quite worth 
while to give a systematic exposition of the theory as it stands, 
i.e. essentially a qualitative theory. One of the directions of 
research at present lies in making theorems quantitative, that is, 
getting estimates for the number of rational or integral points, or 
the generators of the Mordell-Weil group. It is a striking feature 
of the proofs that they are highly non-constructive. There do 
exist still some unsolved qualitative problems. Aside from 
Mordell’s conjecture, for instance, I would conjecture that Siegel’s 
result on integral points remains true for an affine open subset of 
an abelian variety, defined over Z (or a ring of finite type over Z). 
In general, the extension of Roth’s theorem and the theory of 
integral points as given in Chapter VII to varieties of dimension 
>1 is still lacking. 


New York, 1961 S. Lane 


Prerequisites 


They are of different type, some of the chapters being very 
elementary, and others requiring more background. 

Chapters I, If and Chapter III, §1, §2 are elementary and self 
contained, assuming only a knowledge of elementary algebra, say 
at the level of Galois theory, together with a few facts on valua- 
tion rings which may be found in Chapter I of [16]. Chapter III, 
§3 requires some knowledge of algebraic geometry, of an essentially 
standard nature. 

Chapter IV depends only on elementary facts concerning absolute 
values, together with the notion of linear systems and their basic 
properties as exposed in [16], Chapter VI, except for §3 which 
demands a more extensive knowledge, including some facts on 
abelian varieties. However, in our applications, we need the result 
of Chapter IV, §3 only for curves (in which case it can be proved 
by using the Riemann-Roch theorem) and for abelian varieties in 
which case it is easily reduced to the criterion of §2. 

Chapter V uses the theory of reduction modp and the theory of 
abelian varieties. The reader will find all necessary background 
in [36] or [11]. 

Chapter VI is again elementary, depending only on simple 
properties of absolute values. 

Chapter VII is axiomatized to the greatest possible extent, but 
its results combine everything that precedes it. 

Chapter VIII is again almost entirely elementary and in parti- 
cular, the proof of Hilbert’s theorem for the rationals can be read 
with only a knowledge of beginning calculus. 

The reader will therefore see that Chapters I and II give a good 
part of classical algebraic number theory. Chapters III (except for 
§3) and VI may be viewed as providing a continuation of this 
algebraic number theory in a more specialized direction, distinct 
from the multiplicative direction which may be taken by going into 
class field theory at that point. 

For an introduction to algebraic geometry, I refer the reader to 
[16], which will be quoted as IAG. For abelian varieties, all 
results needed are contained in [17], which will be referred to as 
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AV, with the exception of properties of reduction mod p (cf. [36] 
or [11]). References to Weil’s Foundations will be to the first or 
second edition, in canonical style F’ — X, Th. Z, or F* — X, Th. Z. 

A list of symbols used most frequently will be found on p. 167. 
Unless otherwise specified, all rings are without zero divisors, and 
with unit elements. Z is the integers, and Q the rational numbers. 
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CHAPTER I 
Absolute Values 


What distinguishes an arithmetic problem from a geometric one 
is the nature of the ground ring or ground field and any prime 
structure it may have. In addition to the existence of primes (prime 
numbers, prime ideals, etc.) one must also, or even principally, take 
into account archimedean absolute values. It turns out that all the 
foundational material we need can be stated in terms of absolute 
values, and we have included in this chapter everything we need 
in the sequel. 

It is convenient to localize certain questions by taking the com- 
pletion of our field with respect to an absolute value. In fact, our 
insight concerning the extensions of an absolute value on K to a 
finite extension E will comes form an embedding of E into the al- 
gebraic closure of the completion of K. 


§1. Definitions, dependence and independence 


Let K be a field. An absolute value on K is a real valued func- 
tion x— v(x) on K satisfying the following three properties: 

AV 1. We have v(x) = 0, and v(x) = 0 if and only if x =0. 

AV 2. v(xy) = v(x)v(y) for all x, ye K. 

AV 3. Be + y¥) S V(x) + 019). 
If instead of AV 3, the absolute value satisfies the stronger condi- 
tion 

AV 4. v(*«+ y) S max [v(4), v(y)] 
then we shall say that it is a valuation, or that it is non-archimedean. 
In this chapter we emphasize absolute values, taking more or less 
for granted the elementary properties of valuations as given, for 
instance, in JAG [16], Chapter I. 

The absolute value which is such that v(x) =1 for all x #0 is 
called trivial. 

An absolute value on K defines a metric, and thus a topology. 
We shall therefore write |x|, or simply |x| instead of v(x). Two 
absolute values are called dependent if they define the same topology. 
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If they do not, they are called independent. 


Proposition 1. Two non-trivial absolute values, | |, and | |2, on @ 
field K, are dependent 1f and only tf the relation 
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implies |\x|o<1. If they are dependent, then there exists a real 
number 4 >0 such that |x|1=|x\|? for all x € K. 

Proof: If the two absolute values are dependent, then our con- 
dition is satisfied, because the set of x € K such that |x|, <1 is 
the same as the set such that imx” =0 for n-»o, Conversely, 
assume the condition satisfied. Then |x|, >1 implies |x|. >1 
since |x '|, <1. By hypothesis, there exists an element x, € K such 
that |x|: >1. Put a =|, and dD =| x |2, and let A= log(})/log (a). 
Let x ¢ K, and «+0. Then |x|:=|m lr for some a>0O. If 
m,n are integers > 0 such that m/n > a we have |x| < | %0 |” 
whence [| x"/x> |: <1, and thus | x"/x’|2<1, or in other words, 
|xle<|xol"/". Similarly, |x|. >|%|3/". Hence |x|,=|x|?. The 
assertion of our proposition is now obvious. 

One of the main applications of the theory of absolute values 
will be to the rational numbers Q and its finite extensions (which 
are called number fields). It is easy to show that a non-trivial 
absolute value on Q is dependent on one of the following: 

The ordinary absolute value. 

The p-adic absolute value, v» =| |p, defined for each prime num- 
ber p by the formula 


| p’m/n |p = 1/p" 


where +7 is an integer, and m,z are integers + 0 and not divisible 
by p. 

In fact, if the absolute value on Q is a valuation, then looking 
at its valuation ring 9 and maximal ideal m one sees immediately 
that mn Z is a prime ideal, and thus is generated by a prime num- 
ber p. Thus by Proposition 1 our absolute value is a power of the 
p-adic absolute value (cf. IAG Chapter I). 

Suppose that we have an absolute value on a field which is 
bounded on the prime ring (i.e. the integers Z in characteristic 0, 
or the integers mod p in characteristic p). Then it is non-archime- 
dean. One sees this immediately by looking at the expansion of 
(x+y)” and |(«+ y)")|"" as n>. This is always the case in 
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characteristic > 0. 

If the absolute value is archimedean, then we refer the reader 
to [3] or [39] for a proof that it is dependent on the ordinary ab- 
solute value. We do not need this fact in our applications, for we 
always will start with a concretely given set of absolute values on 
fields which will interest us. 

The relation vf dependence on absolute values is obviously an 
equivalence relation (symmetric, reflexive, transitive). An equiva- 
lence class of non-trivial absolute values on a field K will be called 
a prime of K, and will sometimes be denoted by a German letter 
p, or if we deal with an extension of K, by ¥%. If E is an exten- 
sion of K, and pa prime of K, and $a prime of FE, we say that 
% extends p and write {|p if the absolute values in $ when rest- 
ricted to K give absolute values of yp. Similarly, if w is an absolute 
value on EF extending an absolute value on K, we write w|v. We 
shall study in §4 the manner in which an absolute value on K can 
be extended to algebraic extensions of K. 

For the convenience of the reader, we recall some terminology. 
A prime p» is called xon-archimedean if its absolute values are non- 
archimedean. If this is the case, then it belongs to a valuation 
ring 0 = 0, with maximal idealm=m,. Let E be a finite extension 
of K, and let be a prime extending p. It corresponds to a valua- 
tion ring lying above o. Let og be this valuation ring, and mz its 
maximal ideal. Then the residue class field oz/mg is an extension 
of o/m, and its degree is called the residue class degree, denoted by 
dg, or dy if w is one of the valuations of 3%. The value group 
w(K*) is a subgroup of w(E*), whose index is called the ramifica- 
tion index and denoted by eg or e,. They are both finite (cf. IAG 
Chapter I). 

In our applications we shall deal mostly with archimedean abso- 
lute values, or with discrete valuations. Let us say a few words 
about the latter. 

Suppose we have a field A with a valuation ring 0 and maximal 
ideal p. In general, we obtain a valuation whose canonical value 
group is the factor group K */U where U are the units of o (cf. 
IAG, Chapter I). Suppose this value group is infinite cyclic. It 
can then be embedded into the positive reals, preserving the order- 
ing. We select a canonical generator for it (from two possible 
ones) namely, || where z lies in p. Then every element of K i 
can be written in the form 
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where u is a unit, 7 an integer, which is called the order of x at 
the valuation (or prime). In particular, ) is principal, generated by 
zx. Using the same letter p for the maximal ideal of 0, or for the 
equivalence class of absolute values (valuations) which correspond 
to it, we write 


r=ord x =ordy)x =ord.x. 


Using Proposition 1, we see that all the absolute values corre- 
sponding to 0 can be described as follows. We choose a number c 
¢ such that 0 < ¢ < 1, and put 


|x| = cords | 


Such absolute values, or valuations, will be called discrete. 

There is of course considerable arbitrariness in the choise of the 
constant c, and there is usually no way of choosing it in a natural 
manner. However, in some applications, the residue class field o/p 
is a finite field. In that case, denoting the number of elements in 
it by Np, we choose c = 1/Np, and our absolute value becomes 


II 2 [ly = (LIND) 9? 


This will be the case in one of the most important applications, 
namely, number fields, whose non-trivial valuation rings all have 
a finite residue class field. We shall return to this in greater de- 
tail in Chapter II. 

In Proposition 1 we derived a strong condition on dependent 
absolute values. We shall now derive a condition on independent 
ones. 


TueoreM 1. Let K be a field, and | \;,-+-,| |. non-trivial pair- 
wise independent absolute values on K. Let x,,---, x, be elements of 
K, and <e>0. Then there exists x¢K such that 


| ee cere 


fOr Gil 4. 

Proof: Consider first two of our absolute values, say v; and 2,. 
By hypothesis, we can find a€ K such that |@|,; <1 and j|a@a|,=1. 
Similarly, we can find Be K such that | 8|, =1 and leily < il, IPE 
y= Bla. Thenmlg |, > 1 and’ \igal, < 1. 
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We shall now prove that there exists z¢ K such that |z|, > 1 and 
|z|; <1 forj =2,---,s. We prove this by induction, the case s=2 
having just been proved. Suppose we have found z¢€K satisfying 

ge a aes | and |z|;<1 for jg—z,--:,s—L, 


If |2|,;<1, then the element z"y for large n will satisfy our re- 
quirements. 
If |z|, >1, then the sequence 


=r (leet 2™) 


tends to 1 at v, and v,, and tends to 0 at v; (7 = 2,---,s—1). For 
large n, it is then clear that t,y satisfies our requirements. 

Using the element z that we have just constructed, we see that 
the sequence z"/(1 + 2”) tends to 1 at v, and to 0 at v; for 7 = 2,---,s 
For each z = 1, ---, s we can therefore construct an element z; which 
is very close to 1 at v; and very close to 0 at v; (7 #17). The ele- 
ment 


G6 = PAR) Ge ORD ae ee 


then satisfies the requirement of the theorem. 


§2. Completions 


Let K be a field with a non-trivial absolute value v, which will 
remain fixed throughout this section. One can then define in the 
usual manner the notion of Cauchy sequence, and K is said to be 
complete if every Cauchy sequence converges. 

Proposition 2. There exists a pair (Ky, 1) consisting of a field K, 
complete under an absolute value, and an embedding 1: K— K, such 
that the absolute value on K is induced by that of K, (i.e. |x|. = 
|ix |), and such that iK is dense in K,. If (K},i') is another such 
pair, then there exists a unique isomorphism ¢: K,— Ky, preserving 
the absolute values, and making the following diagram commutative: 


Ke ee 


NA 


The uniqueness is obvious. ao proves the existence in the well 
known manner: One considers the Cauchy sequences in A, which 
form a ring. The null sequences form a maximal ideal, and the 
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residue class ring is a field K,. Our field K is naturally embedded 
in K,(by means of the sequences whose elements consist of a fixed 
element of K), and the absolute value on K can be extended to A, 
by continuity. 

One usually identifies K inside K,, and calls K, the completion of 
IG 

If the absolute value is archimedean (and thus K has characte- 
ristic 0), then it is well known that K, is either the field of real 
numbers or of complex numbers, and that the absolue value is de- 
pendent on the ordinary one. For a proof, the reader may see [3]. 
We shall not need this result in this book, and hence we shall al- 
ways assume that the archimedean absolute values we deal with 
have this property. 

As the completion depends only on the prime determined by our 
absolute value, we also denote it by Ky. 

If the absolute value is a valuation, then we refer the reader to 
IAG, Chapter I, for the proof that it can be extended in some way 
to the algebraic closure of K. 


Proposition 3. Let K be complete with respect to a non-trivial 
absolute value v. If E is any algebraic extension of K, then v has a 
unique extension to E. If E is finite over K, then E is complete. 

Proof: We give the proof only in the non-archimedean case, the 
other being trivial once one knows that the only complete archime- 
dean fields are the reals or complex. Without loss of generality, 
we may assume F finite over K. Let us prove the uniqueness. In 
view of Proposition 1, it suffices to prove that two extensions de- 


fine the same topology. Furthermore, if ,, -+-,w, are linearly in 
dependent elements of E over K, and 
(1) 2” = EM oye «+ cee ag EL 6 K 


is a Cauchy sequence in £, then it will suffice to prove that the 
nm sequences £;” converge in K. We do this by induction on n. 
It is obvious for ~=1. Assume m= 2. We consider a sequence 
as above and, without loss of generality, we may assume that it 
converges to 0. (If necessary, consider x” —x™ for v, 4— 0.) 
We must then show that the sequences of the coefficients converge 
to 0 also. If this is not the case, then there exists a number a>0 
such that we have for some j, say j = 1, 


(¥) 


l€i | >a 
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for all » sufficiently large. Then x” /€;” converges to 0, and we 
can write 


™ A ~ 

gor — 1 = ear O2 + ore F gor On. 

We let y™ be the right-hand side of this equation. Then y™ 
yields a sequence which converges (according to the left-hand side 
of the equation). By induction, we conclude that its coefficients 
in terms of 2, -+-,@, also converge in K, say to 72,°°:, 7%. Tak- 
ing the limit, we get 


@, = J2W2 + +++ + YnWn 


contradicting the independence of the o,;. 

We have thus proved that the extension of our valuation is uni- 
que. Furthermore, we have seen that if E is finite over K, of 
degree v, then it has the topology of Cartesian z-space over K: in 
particular, it is complete. The Cauchy sequences of the coefficients 
in (1) above converge in K, say to &,---,&; then our sequence 
x” converges to &,a, +--+ + Enon. 

From the uniqueness, we can get an explicit determination of 
the absolute value on an algebraic extension. Indeed, observe first 
that if E is a normal extension of AK, and o is an automorphism 
of E over K, then the function 


x— | ox | 
is an absolute value on E extending that of K. Hence we get 
[poe imeaselbx'| . 
If E is algebraic over K, and o is an isomorphism of E over K, 
then the same conclusion remains valid, as one sees immediately 
by embedding E in a normal extension of K. (We assume, without 
loss of generality, that all our algebraic extensions are contained 
in a fixed algebraic closure of K.) In particular, if a is algebraic 
over K, of degree n, and if a,-+--,a@, are its conjugates (counting 


multiplicities), then all the absolute values | a; | are equal. Denoting 
by N the norm from K(a) to K, we see that 


| Ma)|=|a|", 
and taking the m-th rvot, we get: 


Proposition 4. Let K be complete with respect to a non-trivial 
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absolute value. Let a be algebraic over K, and let N be the norm 
from K(a) to K. Let n=[K(a): K). Then 


|a| =| Ma) |". 


Of course, the arguments we gave above hold when we deal with 
a valuation, but in the archimedean case the proposition is also 
true, taking into account that we are dealing with the complex and 
real numbers. 

Let us now consider the case of a discrete valuation. We assume 
that K is complete under a discrete valuation, with valuation ring 
d, and maximal ideal m, and we let z bea prime element of o. If 
x,y €K* are such that |x| =|¥y|, we shall write x~y. Let 2; (= 
1,2, ---) be a sequence of elements of 0 such that z;~7z*. If x€o 
we denote by x’ its canonical image in 0/m. Let R bea set of rep- 
resentatives of o/m ino. This means that the map x— x’ of R 
into o/m is bijective. Then every element of 0 can be written as 
a convergent series 


X=Ao + Q1%1 + Ame + -:s 


with a;€R, and the a; uniquely determined. This is easily seen 
by a recursive argument, namely, suppose we have written 


X=Aot++++ + ant, (mod mm’) 


then x — (ao + *++ Qn%n) = Tasty With some ye€o. But by hypothesis, 
we can write y = @n41 + mz With some @nii¢ R. From this we get 


X=Aot +++ + Gntimar1 (modm"*’), 


and it is clear that the m-th term in our series tends to 0. Thus 
our series converges. The fact that R contains precisely one rep- 
resentative for each residue class mod m implies that the a; .are 
uniquely determined. 

Let E be a finite extension of K, let oz, mg be the valuation ring 
and maximal ideal in E lying above 0, m in K, and let 7 be a prime 
element in E. If ly and/x are the value groups of the valuation 
in E and K respectively, let e=(2:I'c). It is the ramification 
index. We then have 

|m |= |x|, 
and the elements 
OSise-1 
G =O 2 
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have order je +7 in E. 

Let ,,-:-,@« be elements of E such that their residue classes 
wi, --+, 4 in E’ =ozg/mz form a basis for E’ over K’. If R is as 
above a set of representatives of o/m in o, then 

ATOR + Laat! + AiWa , 
as the a, range over R, form a system of representatives of 0z/mz 
by definition. From this, one sees that every element of 9; admits 
a convergent expansion 

e—1 d oo ae F 

> > Diol 
4=0 v=1 7-0 
which shows that the ed elements w,i* form a set of generators 
of 0g over 0. On the other hand, one sees easily that they are 
linearly independent over K, and hence we get: 


Proposition 5. Let K be complete with respect to a discrete valua- 
tion. Let E be a finite extension of K, with ramification index e and 
residue class degree d. Then 

C0 | |Fke|, 

If we now combine the interpretation of the degree given in Pro- 
position 5 for discrete valuations with the expression given in 
Proposition 4 we get: 

Proposition 6. Let K be complete with respect to a discrete valua- 
tion and let » be its prime. Let E bea finite extension of K, $B the 
prime extending », and dg the degree of the residue class field ex- 
tension. Then for a¢ E*, we have 


ord, N x(a) = dgordga . 
Proof: This is immediate from the formula 
| N Xa) | = | a RAB 
and the definitions. 


§3. Unramified extensions 


Let K be a field, 0 a valuation ring of K, and m its maximal 
ideal. We let K’ =0/m be the residue class field, and we let g be 
the canonical K’-valued placed on K. We assume that @ is ex- 
tended in a fixed way to the algebraic closure K of K. and call ¢ 


this. extension. 
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Let E be an algebraic extension of K, and og one of the valuation 
rings of E lying above 0, with mz as maximal ideal. If £ is finite 
over K, then 0z/mzg is finite over o/m, and thus ¢ is K'-valued. We 
frequently identify g(oz) with oz/mz, and denote it by £’. 

Let v be a valuation associated with 0, and w an extension of v 
to E associated with oz. We shall say that w is unramified over 
v, or simply unramified, if the residue class field extension E’ over 
K' is separable, and if the ramification index is equal to 1, i.e. if 


(w(E*): (0K) =1. 


If every w extending v is unramifed we say that v is unramified. 

The purpose of this section is to study unramified extensions 
when K is complete. However, we may just as well make a weaker 
assumption: 

Throughout the rest of this section, we shall assume that for each 
algebraic extension E of K, there exists precisely one valuation ring 
of E lying above 0. 

In view of our assumption, we know that if ~ is another place 
of K having the same restriction as g on K, then ¢ and ¢ are equi- 
valent, i.e. there exists an automorphism o’ of K’ over K’' such 
that 

gp=o'¢. 

If f(X) is a polynomial in K[X] whose coefficients are finite under 

gy, then we let f* be the polynomial obtained by applying ¢ to its 


coefficients. Suppose f has leading coefficient 1 and that its coeffi- 
cients are integral over o. If 


d 
K(X) = TI X — ai) 
is a factorization in K, then 


CO = [eee 


41 
is a factorization of f® in the algebraic closure of K’. 

If f’(X) is a polynomial in E’[X] then any polynomial Ff in oz[X] 
of the same degree as f’, with leading coefficient 1 if f’ has lead- 
ing coefficient 1, and such that f° = f’ will be said to be lifted back 
from f' into E. It is clearly always possible to lift back a poly- 
nomial. 
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The above remark concerning the factorization of f shows that 
g maps K onto the algebraic closure of K’. 

Our assumption concerning the uniqueness of the extension of 
the valuation ring yields the following result. 


Proposition 7. Let f be a polynomial in o[X] such that f is ir- 
reducible in K[X] and has leading coefficient 1. Then }® is the power 
of an irreducible polynomial in K'[X}. 

Proof: Suppose f* = g’h’ where g’,h' are relatively prime of de- 
grees 21. Suppose we have a factorization as above, and say ga, 
is a root of g’ and ga, isa root of hk’. There exists an isomorphism 
o of K(a,) over K such that oa; = a, because f is irreducible, and 
we can extend o to K. Thus goa, = ga;. But go is a place coin- 
ciding with ¢ on K, and thus must be conjugate to ¢, i.e. there 
exists an isomorphism o’ of K’ over K’ such that go = 0’y, and 
we must have o’ga, = ga,. This contradicts the fact that g’ and A’ 
are relatively prime, and proves that f® is the power of an irreduci- 
ble polynomial. 

Our purpose is now to establish a bijective correspondence be- 
tween certain extensions of K and separable extensions of K’. We 
shall say that a finite extension E of K is unramified if it satisfies 
the following properties: 

UN 1. The residue class field E’ is separable over K’. 

MN 2. We have [(E:K]=[E£': K'). 

Another way of formulating these conditions is: 


Proposition 8. Let E be a finite extension of K, and let g be the 
canonical place of K. Let [E:K]=n. Then E is unramified over 
K if and only if @ has at least n distinct extensions to places of E 
(in the given algebraic closure of K'), and in that case, it has exactly n. 

Proof: By uniqueness, all extensions of g to E are conjugate, 
and the number of conjugates is equal to the separable degree of 
E’ over K’. As [E’: K'| S [E: K] our assertion is immediate. 

We observe that if E is unramified over K, then the ramification 
index is equal to 1. In the case of greatest interest to us, this is 
characteristic. 


Proposition 9. Assume K complete under a discrete valuation. 
Then the finite extension E of K is unramified if and only if E 48 
separable over K' and the ramification index is equal to 1. 

Proof: This follows immediately from Proposition 5. 
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An arbitrary algebraic extension of K is unramified if every 
finite subextension is unramified. 
We shall now obtain an algebraic criterion for non-ramification. 


Proposition 10. Let E be finite over K. If E is unramified, let 
a’ CE’ be such that E' = K'(a’), and let a€ E be such that ga=a’'. 
Then E = K(a), and the irreducible polynomial f(X) of a over K is 
such that f° is irreducible. Conversely, if K = K(a) for some integer 
a satisfying a polynomial f(X) in o[X] having leading coefficient 1 
and such that f° has no multiple root, then E is unramified over K 
and E’ = K'(¢a). 

Proof: First assume E unramified. Let f’(X) be the irreducible 
polynomial of a’ over K’. Let abe an element of E such that 
ya =a’, and let f(X) be its irreducible polynomial over K. Then 
a is integral over 0, and a’ is a root of f%, and hence f’ divides 
f°. On the other hand, 


deo /’ = |F 2K 1 =(2  iedegy 


and so f’ =f® This proves the first statement. 

Conversely, if @ satisfies the stated condition, then we may as- 
sume without loss of generality that its irreducible polynomial g(X ) 
is such that g? has no multiple roots, because g divides f. We can 
now simply apply Proposition 7, to conclude that g® is irreducible. 
Using the inequalities 


[A\Ge): KK} Sle: K aie: Kk) 


we now conclude that we must have an equality everywhere, and 
that E’ = K’'(ga). This proves our proposition. 

Next, we give the formalism for unramified extensions under 
various operations. 


Proposition 11. Let E be a finite extension of K. 

(i) If E> F>2 kK, then E is unramified over K if and only if ' 
E is unramified over F and F unramified over K. 

(ii) If E is unramified over K, and K, is a finite extension of K, 
then EK, is unramified over K,. 

(iii) Jf FE, and E, are finite unramified over K, then so is E,Eb. 

Proof; T he -first assertion comes from the inequalities 


Ie 1S 2 Ke [EF )S{E:F), (PF: ei 


together with their multiplicativity property in towers. One must 
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also use the fact that assertion (i) holds when ‘‘unramified”’ is re- 
placed by ‘‘a finite separable extension’’. The second assertion is 
an immediate consequence of our criterion in Proposition 10. The 
third comes formally from the first and second. 

Let us denote by vE the image under ¢ of the valuation ring 
dz, for any finite extension E of K. 


Proposition 12. The map E- gE gives a bijection of the finite 
unramified extensions of K and the finite separable extensions of 
K'=oK. 

Proof: We have shown in Proposition 10 that every finite separa- 
ble extension of K’ is obtainable as an image ¢vE, where E is un- 
ramified. We now must show uniqueness. If FE, c EF, are unramified, 
then clearly gE, C gE,. It will therefore suffice to prove that if 
E,, FE; are two unramified extensions of K then ¢(£,FE2) = ¢vE,:¢E. 
To do this, we can write £, = K(a,) and E, = K(a,) where a; and 
a, respectively satisfy the properties expressed in Proposition 10. 
Then £,E, = K(a, a), and we observe that with respect to the in- 
termediate field F,, a, satisfies a polynomial f(X) such that f* has 
no multiple root. Hence (E,E,)’ = Ei(ga,) = K'(ga:, gaz) which pro- 
ves Proposition 12, 


§4. Finite extensions 


Throughout this section, we shall deal witha field K having a non- 
trivial absolute value v. We wish to describe how this absolute 
value extends to finite extensions of K. 

If we let K, be the completion, we know that v can be extended 
to K,, and then uniquely to its algebraic closure K,. If Eis a finite 
extension of K, or even an algebraic one, then we can extend v 
to E by embedding E in K, by an isomorphism over K, and taking 
the induced absolute value on E. We shall now prove that every 
extension of v can be obtained in this manner. 


Proposition 13. Let E be a finite extension of K. Let w be an ab- 
solute value on E extending v, and let E., be the completion. Let 
K,, be the closure of K in E, and identify E in Ey. Then E,=EK. 
(the composite field). 

Proof: We observe that K, is a completion of A, and that the 
composite field EK, is algebraic over K, and therefore complete 
by §2, Proposition 3. Since it contains E, it follows that FE is dense 
in it, and hence that FE, = EKy. 
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If we start with an embedding o: E— K, (always assumed to be 
over K), then we know again by § 2, Proposition 3 that oE-K, is 
complete. Thus this construction and the construction of the pro- 
position are essentially the same, up to an isomorphism. In the 
future, we take the embedding point of view. We must now de- 
termine when two embeddings give us the same absolute value on 
EE 

Given two embeddings o,r: E— K,, we shall say that. they are 
conjugate over K, if there exists an automorphism 2 of K, over Ky 
such that o = Ar. We see that actually 4 is determined by its ef- 
fect on tE, or tE- Ky. 


Proposition 14. Let E be an algebraic extension of K. Two em- 
beddings o,t: E— K, give rise to the same absolute value on E if and 
only if they are conjugate over K,. 

Proof: Suppose they are conjugate over K,. Then the unique- 
ness of the extension of the absolute value from K, to K, guarantees 
that the induced absolute values on E are equal. Conversely, sup- 
pose this is the case. Let 4: rE ~cE be an isomorphism over K. 
We shall prove that 2 extends to an isomorphism of rE-K, onto 
oE-K, over K,. Since tE is dense intE&-K,, an element xe€rE-K, 
can be written 


59 = Ibi ese, 


with x,¢E. Since the absolute values induced by o and rt on E 
coincide, it follows that the sequence Arx, = ox, converges to an 
element of oE-K, which we denote by 4x. One then verifies im- 
mediately that 2x is independent of the particular sequence tx, used, 
and that the map 4: rE: K, ~ cE: K, is an isomorphism, which clearly 
leaves K, fixed. This proves our proposition. 

In view of the previous two propositions, if w is an extension of 
v to a finite extension E of K, then we may identity E, and a 
composite extension EK, of E and K,. If N=[E:K] is finite, then 
we shall call 


Ni > bere) 
the local degree. It is clear that 51,,,,[Ew: Ki] S$ [E: K). 


Proposition 15. Let E be a finite separable extension of K, of de- 
gree N. Then 


N=SNoe. 


wily 


{I, §4] FINITE EXTENSIONS 15 


Proof. We can write E = K(a) forasingle element a. Let f(X) 
be its irreducible polynomial over K. Then over K,, we have a 
decomposition 


feu fiien = f(x) 

into irreducible factors f,(X). They all appear with multiplicity 1 
according to our hypothesis of separability. The embeddings of E 
into K, correspond to the maps of a onto the roots of the fj. 
Two embeddings are conjugate if and only if they map a onto roots 
of the same polynomial f;. On the other hand, it is clear that the 
local degree in each case is precisely the degree of f;. This proves 
our proposition. 

There exist interesting cases in characteristic p for which the 
relation in the proposition holds without the hypothesis of separa- 
bility. This is notably the case for the discrete valuation arising 
from a simple subvariety of codimension 1 on an algebraic variety. 
For a sufficient condition under which the relation holds, including 
the special case just mentioned, the reader may see [3]. 

Whenever v is an absolute value on KA such that for any finite 
extension EF of K we have 

Per. a) — > [Ew K,] 
we shall say that wv is well behaved. Suppose we have a tower of 
finite extensions, L > E> K. Let w range over the absolute values 
of E extending v, and uw over those of L extending v. If u|w then 
L, contains E,. Thus we have: 


SD (Lu: Ko] = SS (La: Eu) (Ew: Ko) 


ulv w\y ulw 


= 3 (Ev: Ko] 3 (Lu: El 


why ulw 


< 3 (Ev: KJ[L: E] 


See re ee 
From this we immediately see that if v is well behaved, F finite 
over K, and w extends v on E, then w is well behaved (we must 
have an equality everywhere). 
Let E be a finite extension of K. Let p’ be its inseparable de- 
gree. We recall that the norm of an element a€K is given by the 
formula 


N(a) = J] oa?” 


16 ABSOLUTE VALUES [I, §4] 


where o ranges over all distinct isomorphisms of E over K (into a 
given algebraic closure). 

If w is an absolute value extending von £, then the norm from 
E,, to K, will be called the local norm. 

Replacing the above product by a sum, we get the trace, and 
the local trace. We abbreviate the trace by Tr. 


Proposition 16. Let E be a finite extension of K, and assume that 
v is well behaved. Let ac E. Then: 
N&@) = TT Nz%a) 


wiv 
Tria) = 3S Trz”(@) 
w\v 
Proof: Suppose first that E = K(a), and let f(X) be the irreduci- 
ble polynomial of a over K. If we factor f(X) into irreducible 
terms over K,, then 


TG) Fi) oe) 


where each /;(X) is irreducible, and the f; are distinct because of 
our hypothesis that v is well behaved. The norm N(a) is equal 
to (—1)**" times the constant term of f, and similarly for each fi. 
Since the constant term of f is equal to the product of the const- 
ant terms of the /;, we get the first part of the proposition. The 
statement for the trace follows by looking at the penultimate coef- 
ficient of f and each f;. 

If EF is not equal to K(a), then we simply use the transitivity of 
the norm and trace. We leave the details to the reader. 

Taking into account Proposition 4 of §2, we have: 


TuHeoreM 2. Let K have a well behaved absolute value v. Let E 
be a finite extension of K, and acE. Let 


Ng [eK | 
for each absolute w on E extending v. Then 
Pilei.? (Nia) |e. 
wv 
We conclude this section with some remarks on valuations which 
show how the ramification index and residue class degree remain 
the same when we pass to the completions. This will be especially 


useful in our applications to discrete valuations, taking into account 
Proposition 5 of § 2. 
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Suppose that we start with a field K with a non-trivial valuation 
v, corresponding valuation ring 0, and maximal ideal m. Let us 
denote for a moment by K the completion of K, and let 0 resp. in 
be the closure of 0 resp. m in K. Using the fact that if |x| +#|y| 
then | x + y| = max(| «|, ||) we see that 6 is a valuation ring of 
K, that th is its maximal ideal, and that 


inK=0, nee = 
Thus we have a canonical isomorphism 
o/m—o/m , 


If E is a finite extension of K, and oz, my a valuation ring and its 
maximal ideal lying above 0, m and if we denote by a * the com- 
pletion, then we have a canonical commutative diagram: 


tiie — + Deine 

olm —-> d/fh 
the vertical arrows being inclusions. Thus the residue class field 
extension can be studied either over the given field K or its com- 
pletion. 

Similarly for the ramification index: If x¢€K and ye K is such 
that |y— «|< |x|, then |y—x+<x|= {| and hence the inclusion 
map 

v(K *) > o(K*) 
is an isomorphism (the value groups are the same). If now FE is a 
finite extension of K, then we have the commutative diagram 


w(E*) —> w(E*) 


i. | 


uv K*) — o(K*) 
for any valuation w extending v to E, the vertical arrows being 
inclusions. Thus the ramification index can be studied locally over 
the completion. 


Proposition 17. Let v be a discrete, well behaved valuation on K, 
and let E be a finite extension of K. For each extension w of v to 
E, let 
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ew = (W(E*) : (K*)) 
Gz = On) iile =e Oy) Mel 
be the ramification index and residue class degree respectively. Then 
2 Cu a Paw « © [es 


If E is Galois over K, then all e, are equal to the same number e, 
all d, are equal to the same number d, and so 


[E: K] = edr 


where r is the number of extensions of v to E. 

Proof: Our first assertion comes from our assumption, and Pro- 
position 5 of §2. If E is Galois over K, we know that given two 
extensions w,, Ww, of v there exists an automorphism o of E over K 
such that w, = w.o. This symmetry implies our second assertion. 


Historical Note: 


The theory of absolute values is perfectly classical. We have, 
of course, not been exhaustive, giving only what we need most in 
the sequel. The reader wanting a more complete treatment may 
consult Bourbaki’s Commutative Algebra, or Artin’s Algebraic Num- 
bers and Algebraic Functions. Actually, our treatment is much in- 
fluenced by Artin’s. Furthermore, although many special cases of 
the approximation theorem were known before (Chinese remainder 
theorems), it was first stated and proved in full generality for ab- 
solute values by Artin-Whaples [2]. 


CHAPTER II 


The Product Formula 


We shall give at the end of §1 the three most important examples 
of the type of field which concerns us. They are essentially fields 
of finite type over the prime field or over some constant field, the 
latter giving rise to relative theories. 

On these fields, when we take suitable models of them, we get 
families of absolute values. These absolute values are either dis- 
crete or archimedean. A proper set of absolute values will be 
defined so as to axiomatize what is common to all of them, and to 
give us what we shall need later for certain applications, for in- 
stance, our study of height functions, or divisors and divisor classes. 

It is particularly important when our set of absolute values will 
satisfy a product formula, the classical case being that of number 
fields. We place special emphasis on this case, and shall go deeper 
into the study of the group of ideal classes and the unit group, 
giving the classical results, which are more than qualitative. (In 


| the case of units, they give the number of generators.) 


In considering the examples of finite type (Examples 2 and 3) 
we limit ourselves to qualitative results, to the effect that certain 
groups of units and ideal classes are of finite type also. The proofs 


are essentially elementary, except for one reference to the Mordell- 


Weil theorem of Chapter V. 


§1. Proper sets of absolute values 
Let K be a field. An absolute value v on K is said to be proper 
if it is non-trivial, well behaved, and if, K having characteristic 0, 
its restriction to Q is either trivial, the ordinary absolute value, or 


a p-adic absolute value v». 


A set Mx of absolute values on K is said to be proper if every 


absolute value in it is proper, if any two distinct absolute values 


| 


are independent, and if, given x ¢ K, x #0, there exists only a 


finite number of v¢ Mz such that |x|,#1. In particular, if Mx 
is proper, there can be only a finite number of archimedean absolute 


values in Mx. 


Ly 
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If E is an algebraic extension of K, we shall denote by Mz the 
set of absolute values on E extending some absolute value in Mx. 
If E is finite over K, then Mz, is proper if Mx is proper. (Its 
absolute values are well behaved by Chapter I, §4, and the other 
conditions are trivially verified.) 

Let Mz be a proper set of absolute values on K. For each 
ve Mg, let 4, be a real number >0. We shall say that Mx satisfies 
the product formula with multiplicities 4, if for each xe K, x #0, 
we have 

Ges ae 
vEM 
By assumption, there is only a finite number of terms in this 
product which are not equal to 1, so the product makes sense. We 
shall say that Mx satisfies the product formula if all 2, =1. When 
we deal with a fixed set of multiplicities 4,, then we write for 
convenience 


[| lo = | * [2° 


so that our product formula reads 


IiNe«l),=1. 
vEMe 
Suppose now that we have a field F with a proper set Mr of 
absolute values satisfying the product formula with multiplicities 
1. Let K be a finite extension of F, and let Mx be the set of 
absolute values on AK which extend the absolute values of Mr. 
Then Mz is also a proper set of absolute values on K. If vo ¢ Mp 
and ve Mx, with v|v, set N, =[Ky:F,,]. Then for any element 
acE, a+0, we get by Theorem 2 of Chapter I, §4 
1= T[ | NF) |v 


wEME 


=i Wee 


EME v\% 


=(ilaeis?. 


vEME 


This shows that Mz satisfies the product formula with multi- 
plicities N,. 


Example 1 


The classical example is that of the rational numbers Q. For 
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each prime number p we have the absolute value v, described in 
Chapter I, §1. The ordinary archimedean absolute value will be 
said to belong to the prime at infinity, and we shall also use p or 
p. to denote its prime. If / is a prime number, then 


1 if pis a prime number +/ 
\Z\p= 41/l if p is a prime number =/ 
1 if p is the prime at infinity . 


Thus the set Ma of all absolute values v, (including the one at 
infinity) satisfies the product formula, the above argument showing 
this for prime numbers of Z, the general case following by multi- 
plicativity. It is also clear that our set is proper (using Proposi- 
tion 15 of Chapter I, § 4). 

If AK is a number field, then the set Mx of absolute values ex- 
tending those of Ma will be called the canonical set. 

If J = Ix is the ring of integers of K (i.e. the integral closure of Z 
in K) then the valuation rings of K are in bijective correspondence 
with the prime ideals p of J. We shall always exclude the trivial one. 
Each valuation ring is of type J,, the local ring of J at p JAG, 
Theorem 4 of Chapter I, §6). The primes of K corresponding to 
these valuation rings will be called the finite primes of K. Those 
primes corresponding to the absolute values extending the ordinary 
absolute value on Q will be called the infinite primes, or primes at 
infinity. 

Let K be a number field, and v one of the absolute values ex- 
tending the ordinary absolute value on Q. Then K, is either the 
field of real numbers of the field of complex numbers. We then 
say that v is real or complex, accordingly, and use a similar ter- 
minology for its associated prime. The multiplicity 


N, = [K, ‘ Q,] 


is then 1 or 2 according as v is real or complex, and thus for 
x € K, we have 


Weaite = [ale * 


either |x| or |x|’. 

Let us now consider a finite prime corresponding to a prime ideal 
p in the ring of integers J of K. Then there is a unique prime 
number p lying in p and we denote by v, or | |, the absolute be- 
loning to p extending the p-adic absolute value on Q. Its multi- 
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plicity is 
Ny = [Kp : Qe] = epdy 
where é, and dy are the ramification index and residue class degree, 
respectively. Letting 0, and my be the valuation ring and maximal 
ideal corresponding to p, we see that the number of elements in 
Dy/my, which we denote by Np, is 
Np = pr, 
Since Dp/my is of degree dy over Z/pZ. Furthermore, let z be a 
prime element at p. Then by definition, 
[xP |p =|Plp =1/p. 
Hence 
| = |,p°» = (1/Np) . 
From this we get: 
Proposition 1. Let K be a number field, » a finite prime, Np the 


order of o,/my, and | | p the absolute value extending the p-adic absolute 
value on Q. Let || |lp =| ee Then for xe K, x #0, 


I] © Ilp = CL/Np)e* . 


As our set of absolute values in Mg is in bijective correspond- 
ence with the primes of A, we write our product formula in the 
form 


PL lige 


pete 


Example 2 


Aside from algebraic number fields, the most important example 
of a proper set of absolute values satisfying the product formula 
is that obtained from the subvarieties of codimension 1 on a pro- 
jective variety, non-singular in codimension 1. We shall now des- 
cribe this situation. 

Let k be a field, V a projective variety defined over k, and non- 
singular in codimension 1. A prime rational divisor p of V over k 
is a cycle of codimension 1, 


p=4>w’ 
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where W is subvariety of V, of codimension 1, defined over an 
algebraic extension of k, W° ranges over the distinct conjugates 
of W over k, and yw is its order of inseparability over k. Such a 
prime rational divisor (also called a prime divisor) determines its 
local ring in the function field k(V) of V over k, and this local 
ring is a discrete valuation ring. Thus we have the notion order 
at pof a function xe k(V), x #0. By the degree of p, noted deg », 
we shall mean the projective degree, i.e. the number of points of 
intersection with a generic linear variety of complementary dimen- 
sion in the given projective embedding. Similarly, we have the 
degree of an arbitrary divisor, i.e. the number of points of inter- 
section with a linear variety, generic over a field of definition for 
all its components. If xe k(V), x +0, and (x), (x), (x)» denote the 
divisor of x, its divisor of zeros, and its polar divisor respectively, 
then 


deg (x)o = deg (X)0 


and hence deg (x) = 0. 
Now let c be a number, 0 < c <1, and for each prime divisor p 
of V over k, let, for x +0, 


Vy (x) = | x I; = OFdy2) deg (p) ; 

For K=&(V), we then get a family Mz = M;.y) of proper absolute 
values, which satisfy the product formula in view of the remark 
we just made. This will be said to be the family associated with 
the model V of the field K. Once our number c is chosen, we have 
a bijective correspondence between prime divisiors and absolute 
values in M;.y,), and thus we shall also write our product formula 


TI |[*lbp=1. 
PEt ny) 
As we have multiplicities =1, we have || x ||) =| lp. 

By a function field K over the constant field k we shall always 
mean a finitely generated, regular extension of k. Each model V 
of K over k which is projective and non-singular in codimension 1 
gives rise to absolute values as above. This slightly restricts the 
notion of function fields as used in the classical theory of function 
fields in one variable, i.e. finitely generated extensions K of a 
field k which are of transcendence degree 1 and such K is alge- 
braically closed in k. For such fields, one also has a product for- 
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mula for the set of discrete valuations, trivial on k, such valuations 
being well behaved and hence proper. 


Example 3 


Let R be a Noetherian ring, which we assume to be integrally 
closed. If p is a minimal prime ideal of R not equal to 0, then 
the local ring Ry is a discrete valuation ring in the quotient field 
of R, and an element of R has only a finite number of zeros from 
among such p. Suitably normalizing the absolute values we get 
from such a ), in case they induce a non-trivial absolute value 
on the prime ring, we get a proper set of absolute values on the 
quotient field of R, which will be denoted by Mz, or Mr,x. 

By a variety scheme over a ring R, we shall mean a scheme V 
over spec (R) which is irreducible, of finite type, having no divisors 
of 0 in its local rings, and such that if k is the field of fractions 
of R, and V* the extension of V to the ground field k, then the 
field of rational functions on V“ is a regular extension of k. Then 
V* is essentially the same thing as a variety, defined over k. (For 
schemes, see [11].) 

A variety scheme V over R will be said to be zormal if each 
one of its local rings is integrally closed, and if R is integrally 
closed. 

Now let R be Noetherian, and V a variety scheme over R. 
Assume that FR is integrally closed, and V normal. Let & be the 
field of fractions of R, and K the function field of V, so that K 
contains . 

We then get, on the one hand, a family of discrete valuations 
on K which are trivial on k, corresponding to the prime divisors 
on V* over &. (If V is affine, a prime divisor in this case is deter- 
mined by a minimal non-zero prime ideal of the affine ring of V“ 
over k. It need not be absolutely simple, merely k-simple.) This 
set will be denoted by M;.r. 

We get however other discrete valuations. For each prime ideal 
p of R which is minimal not equal to 0, the local ring Ry is, aS we 
have already pointed out, a discrete valuation ring of k. The fiber of V 
above the point p in spec(R) is either empty, or consists of a finite 
number of components, each of which gives rise to a discrete valuation 
on K, corresponding to what we shall call a prime divisor on V lying 
above ». We denote by Me,p the set of such discrete valuations. 
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We let M, be the union of Mg,y and M;,,y. Then M, is a set 
of discrete valuations on K, and each non-zero element of K is a 
unit at all but a finite number of them. By suitably normalizing 
the absolute values corresponding to these valutions which induce 
non-trivial valuations on Q, we get a proper set of absolute values 
on K, also denoted by M,. It does not in general satisfy the pro- 
duct formula. 

We recall that in view of our assumptions on V, and R, all but 
a finite number or primes p¢€ Me are such that there is exactly one 
prime of Mz, lying above it, corresponding to a ‘‘non-degenerate’’ 
reduction of Vmod p. We shall denote by Mz.y the subset of Mary 
lying above such p in Mr, so that Mpg.y can be identified with a 
subset Mf of Me, differing from Mz by only a finite set of elements. 

In studying variety schemes above a ring, the case in which all 
objects are of finite type over the prime ring is particularly im- 
portant, and we shall frequently use the following result. 


Proposition 2. Let R be a ring finitely generated over the prime 
ring. Then there exists an element a+#0 in the prime ring such 
that the integral closure of R{1/a| (in its quotient field) is a finite 
module over R[{1/a]. 

Proof. The case of characteristic >0 is standard, since the prime 
ring is a field (Noether’s Normalization Theorem). The case of 
interest to us here is that of characteristic 0. Denote by Rg the 
algebra generated over Q by R. By the normalization theorem, 
there exist elements ?¢,,---,¢,¢R algebraically independent over Q 
such that Rg is integral over Q[t;,---,¢,]. Since FR is finitely 
generated over Z, there exists an element a¢Z, a#0 such that R 
is integral over Z[l/a,¢,,---,¢t,]. Since this latter ring is integrally 
closed, its integral closure in the quotient field of R is a finite 
module over it, say by IAG, Theorem 2 of Chapter V, §1. 

Remark. It is in fact true that the integral closure of RF itself 
is a finite module over R, but this theorem, due to Nagata, is very 
much more difficult to prove. 


§2. Divisors, ideals, and units 
Let K be a field with a proper set of absolute values Mg. By an 
M,-divisor one means a real valued function 
v — d(v) 
of the absolute values in Mx with the following properties: 
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(i) b(v)>0 for all v¢ Mg, and 0(v) = 1 for all but a finite number 
of the v’s. 

(ii) For each non-archimedean v¢ Mx, there exists ae K* such 
that d(v) = v(a). 

If ae K* then the function v(a) on Mx is an Mx-divisor, denoted 
by ds, and is called principal. 

We observe that the Mx-divisors are closed under the operations 
of sup and inf, and that they form a group under multiplication 
(componentwise). 

If E is an algebraic extension of K, then for each we My we 
extend an Mx-divisor } to Mz, by the formula 


d(w) = d(v) 


if w extends ve Meg. If E is finite over K, then this is an M,- 
divisor, and our extension maps the Mx-divisors injectively into 
the M,-divisors. By abuse of language, even if E is infinite, we 
sometimes say that our extension is an M,-divisor (although condi- 
tion (i) might be violated). 

Suppose that v is discrete. If z is a prime element at v, then 


d(v) = | z |e 


for some integer m, which is called the order of d at v, and ab- 
breviated ord,d, or ord,d if p is the prime associated with v. 

Let us denote by Mx the subset of Mx consisting of the non- 
archimedean absolute values. 

Suppose that all the absolute values Mx are discrete, and that 
for each M,-divisor we take its restriction to these non-archimedean 
absolute values. These restrictions will be called finite divisors, 
and they form a group which is clearly isomorphic in the natural 
manner with the free abelian group generated by the primes corre- 
sponding to the absolute values in Mg. This latter group will be 
called the group of Mx-ideals. Each element x¢ K, x #0 determines 
an M,-ideal denoted by (x), and called a principal Mg-ideal. The 
principal Mx-ideals form a group, and the factor group is called 
the group of M,-ideal classes. 

If a is an M,-ideal, then we can write 


oe di yp 
p 


with Ny € Z, and p ranging over the primes of Mg, all but a finite 
number of n., being 0. We write 
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If we K*, then ord, a = ord, (a), i.e. it is the order of its divisor 
at p. 
If two Mz-ideals a and 6 lie in the same ideal class, we write 


= ord,a K 


a~b. 


This is the notion of linear equivalence, in Example 2 of §1. An 
Mr,-ideal is said to be integral if ord, a > 0 for all p in Mg. 
By an Mx-unit we shall mean an element a¢€ K* such that 


|a|lo= 


for all non-archimedean absolute values v of Mz. The Mg-units 
obviously form a multiplicative group, which we denote by Uy,, 
Ug, or simply U. 

In Example 1 of §1, our group of Mx-ideals is isomorphic in the 
obvious manner with the classical group of ideals of the ring of 
integers (including the fractional ideals, of course). The Mx-units 
are simply the units of the number field in the classical sense. 

In Example 2 of §1, the group of M,-ideals is simply the group 
of divisors on the variety, rational over the constant field &. The 
units are the non-zero constants (since we assume the variety com- 
plete and non-singular in codimension 1). 

In Example 3, we have: 


Proposition 3. Let R be a Notherian, integrally closed ring. Let 
Mr be its set of primes as in Example 3 of §1. Then R* is the 
set of Mp-units of the field K of fractions of R. 

Proof. It is well known from commutative algebra that R= NR,, 
the intersection being taken over all pe Mer. If we K is an Mp- 
unit, then uw lies in each Ry since, in particular, it has no pole 
among the p’s. Hence it is in R, and since it has no zero, it must 
be invertible (Krull’s principal ideal theorem). The converse is 
clear. 

Starting with a proper set of absolute values Mx on a field K, 
we shall frequently let S be a finite subset, containing the archi- 
medean absolute values which may occur in Mg. It is customary 
to call S-units the (Mx — S)-units, i.e. those elements of K having 
absolute value 1 for all v¢ S. These form a multiplicative group 
Ks. We shall study in § 4 the qualitative effect of removing 
such a finite set from consideration. For number fields, we shall 
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give in §3 the number of generators of the S-units. 

Let us now consider a number field K with its canonical set of 
absolute values M,z. For each finite prime, we had defined Np to 
be the number of elements in the residue class field of its valua- 
tion ring. We extend this to Mg-ideals by linearity, so that 


Na = J] (Np)°"r" . 


It is clear that there is only a finite number of p’s such that Np 
lies below a given number, and hence that there is only a finite 
number of integral Mx-ideals a such that Na lies below a given 
number. 

We shall now prove that the M;-ideal classes form a finite group. 
We need a lemma. 


Lemma 1. Let I be the ring of integers of the number field K. 
Let a be an integral Mx-ideal, a= DLMyD. Then Na is the number 
of elements of the module 


IT 4,/myP 
(the product being taken over the finite primes, and all but a finite 
number of terms being trivial). 

Proof. The valuation ring I, is a discrete valuation ring, and 
its maximal ideal m,, generated by z, is principal. As a module 
over I,/My, 

r Tak 
i I,/r I, 
is isomorphic to /,/m, itself, under multiplication by x”, 7 being an 
integer 20. We have the chain of ideals 


2 e 
12 ah 2 wl +++ Dx’, 


and from this we see that the number of elements in the module 
I,/z"I, is equal to (Np)". By definition, this is equal to N(vp) for 
the M,-ideal rp. Our assertion is now obvious by linearity. 


Proposition 4. Let K be a number field, a¢ K*, and a=(a) its 
Mry-ideal. Then for each prime number p, we have 


ord, Na = ord, NQ(a) 
and consequently 
|Na| =| N@(@)| = Na 
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where | | is the ordinary absolute value. 
Proof. For each p| p, let d, be the residue class degree. By 
Proposition 16 of Chapter I, §4 we have 
NG@@ = TL NGK@) 
p 
and hence, using Proposition 6 of Chapter I, § 2, 


ord, NE(a) = Siord, N&%a) 
pl? 


= 314, ordy a. 
pip 
Our first assertion now follows by definition, and the second is 
obvious from the first. 


TuHeoreM 1. Let K be a number field. Then there is a constant 
C depending on K only, such that for any integral Mx-ideal a there 
exists an integral idealb~a such that Nb <= C, and the group of Mx- 
ideal classes is finite. 

Proof. Our second assertion is an immediate consequence of the 
first and of the fact that there are only a finite number of integral 
M,-ideals 6 with Nb< C. Let us prove our first assertion. 

Let J be the integers of K. It is a finite module over Z, which 


is a principal ideal ring, and thus has a basis a, ---, wy over Z. 
Let S be the set of elements of J of type 
QO; +++: + ayOn 


with a; integers, such that 
0<a;S(Na)/" +1. 


Then there are more than Na elements in S, and thus there are 
two distinct elements a, 8 in S such that a— 8 =€& will map into 
0 in the homomorphism 


T> Fie ** 
(referring to Lemma 2). It follows from this that (€) 2a, and that 
we can write 


€)=a+b 
with an M,-integral ideal 6. On the other hand, we estimate 


| N&@ | = [L | cor + +++ + ewe | 
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where 0 < c; < (Na)'/” +1, and we see that there is a constant C 
(depending on the maximum of the archimedean absolute values of 
the w;, and on AN) such that 


| NE(6)| < CNa. 


Using Proposition 4, we get No<C, and b~a by definition. This 
proves our theorem. 


§3. Unit theorem 


Let K be a field with a proper family Mx of absolute values. 
For each Mx-divisor } we shall denote by L(d) the set of elements 
xé€K such that for each v¢e Mx we have 


| 28 lin S XD) 


If a¢ K*, and if we denote by abd the product bed of the principal 
M,x-divisor a times d, then L(ad) and L(d) are in canonical bijection 
under the mapping 


x ax xeELd). 


We denote the number of elements of L(d) by 4(b). Then Aad) = 
Aid). If we think of d as prescribing the sides of a box, all but a 
finite number of which are 1, then 4(d) may be interpreted as the 
number of field elements in the box. 

We define |d{, = d(v), and when we have multiplicities N,, 


[| DI]. = d()*” . 
We define the KA-size, or size of d to be 
HDilx = JT Idle. 
ve M 
If Mx satisfies the product formula with mulitiplicities N,, then 
the size of } is the same as that of ad. This size may be inter- 
preted as giving the volume of our box. We shall now prove that 


in number fields, the number of elements in the box is approxi- 
mately equal to the volume. 


TueoreM 2. Let K be a number field with its canonical proper 
set of absolute values Mg. There exist two numbers c,,¢, >0 de- 
bending only on K, such that for any Mx-divisor d, we have 


C1 || 0 |lx < 40) S sup [1, ce || D |Ix] . 
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Proof. Suppose that for some archimedean absolute value v, in 
Myx, the completion K,, is complex. In this complex plane, con- 
sider the square centered at the origin, with sides of length 2d(v). 
Let m be an integer such that 


m<ko'*?*<m+1. 


Without loss of generality, we may assume m +0, and so m= 1. 
Cut up each side of the square into m equal parts, thus giving rise 
to m* small squares inside the big one. Our set L(b) is embedded 
inside the big square at K,,. Since it contains more than m’ ele- 
ments, there exist two distinct elements x, y¢L(bd) lying in the 
same small square. Hence for their difference, we get the estimate 


20(vo) 
eS lege 


If v is any other archimedean absolute value of Mx, we get 
|x —y |» S 2d(v) 
and if v is non-archimedan, then 
ja — yl oy) 
Taking the product, we get 
1= TT |x—y le? = 4" || 9 Ilx/m? 


vEM 
where s is the number of archimedean primes in Mg. Since 
(m + 1)? S$ 4m’ the inequality on the right follows immediately. 

If there is no complex absolute value in Mx, then we proceed 
in a similar manner, using a real one vo, and cut up the interval 
centered at the origin of length 20(v) into m equal pieces, giving 
rise to m small intervals, with 


m < AD eam +1. 


The rest of the argument follows in the same way 
Let us prove the other inequality. Let J, be the integers of K 
(i.e. the integral closure of Z in K) and let 1, -::, ww be a basis 
of Ix over Z, so that N=[K:Q]. Put 
65 = SUDs 4 |e |+ 


the sup being taken over the archimedean absolute values v in Mx 
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and over i. This is a number depending only on K. 

Let » be our given Mx-divisor. By the approximation theorem 
(Theorem 1 of Chapter 1, §1) there exists an element aéK™* such 
that 


Oy SS | aad | = Be 


for each archimedean absolute value v in Mx. We now select an 
element a@€Z, a #0, such that aad has absolute value <1 at all 
non-archimedean vé€ Mz, just by taking a highly divisible at a 
lot of prime numbers. In view of the fact that 4(d) and || 0||x do 
not change if we multiply > by an element of K*, we may there- 
fore assume, without loss of generality, that our Mx-divisor satisfies 
the inequalities 


Co \a@ los (2, Ss 2c le |» 


for some element a¢€Z, a> 0. 

We must exhibit elements of L(d). For this purpose, consider 
the set L of elements of Jx consisting of those which can be ex- 
pressed in the form 


Qo; + +++ + yon 


with a4;€Z, and 0<a;<a. Then our set L contains more than 
a™ elements. 

For each non-archimedean prime p in Mg, let v, = ordyd. Let 
Op, Mp be the discrete valuation ring and its maximal ideal, and con- 
sider the additive group 


TT 2p/mp? 
which has II(Np)’? elements. In the additive homomorphism of J; 


into this group, we look at the image of L. There will be a subset 
L’ of L with at least 


an 


[[(Nv)”? 


elements, all of which have the same image. Take one fixed element 
xéL', and let y range over L’. Then for each non-archimedean 
absolute value v in Mx we have 


ey ee 0) 


because ord,(x — y) 2 ordpyd. If v is archimedean, then 
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| =) |> = Cola =) 


by an obvious estimate. Thus our elements x — y lie in Lid). We 
have therefore proved that 


1 
N 

Abd) 2a yy : 
We observe finally that 


a = ff laiy*> ec: I] |i? 


VV 60 Vb Veo 


the product being taken over the archimedean absolute values, and 
c, being an obvious constant, while 


1/Np’p = || d ||. 


if v is the non-archimedean absolute value belonging to p. Taking 
the full product over all absolute values proves our other inequality. 
Let K be a number field, and S a finite subset of Mx containing 
the archimedean absolute values. Let s be the number of elements 
of S. Let Ks be the S-units, i.e. the elements x¢ K* such that 


Ilx|]h=1 vé€S. 
We map Kz into Euclidean s-space as follows. Let um, ---,v,; be the 
absolute values of S. Map 
x — (log || x |[1, +++, log || Il.) , 
and call this map 
log: Ks > R’. 


By the product formula, the image of As is contained in the 
hyperplane defined by the equation 


E+---+8=0, 


so that this image is at most (s—1)-dimensional. The following 
theorem describes this image. 


Tueorem 3. Let K be a number field, S a finite subset of Mx con- 
taining the archimedean absolute values, and s the number of elements 
of S. Let Ks be the S-units. Then log(Ks) is an (s — 1)-dimensional 
lattice. 

By saying that it is a lattice, we mean that it is a discrete sub- 
group of R’, and by saying that it is (s — 1)-dimensional, we mean 
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that the vector space generated by it is the entire hyperplane 
mentioned above. Thus in particular, it follows that log(Ks) is a 
free abelian group on s—1 generators. Since the kernel of our 
log mapping is clearly the set of roots of unity in K (its elements 
form a group, and have bounded absolute value, so form a finite 
group), we get as a corollary: 


Corotuary. Let K be a number field, S a finite subset of Mx 
containing the archimedean absolute values. Then Ks modulo the 
group of roots of unity in K is a free abelian group on s generators 
(s = number of elements of S). 

Let us now prove the theorem, which is known as the unit theorem. 

We begin by observing that in any bounded region of R* there 
exists only a finite number of elements of log(Ks). Indeed, if 
log (x) lies in such a region, then the absolute values of x and its 
conjugates must be bounded, and hence x can satisfy only a finite 
number of equations of degree <[K:Q] over Q, because the co- 
efficients of such equations are elementary symmetric functions 
of x and its conjugates. By a well known property of Euclidean 
space, whose proof we shall recall at the very end, it follows that 
log(Ks) is a discrete, finitely generated subgroup of R*. We must 
now show that it has dimension s — 1. 

For this purpose, we shall first prove that given an index 7, there 
exists a vector (&:, ---,&;,) in log(Ks) such that &; >0 and &; <0 
for 7 #1. We shall then prove that any s—1 such vectors are 
linearly independent over R. 

We need the following lemma: 


Lemma 2. Given v,¢€ Mx there exists a number c(vo) >0 such that 
for any Mx-divisor » there exists BE K* such that 


1 5 || 80 ||» S co) 


for allv =v, in Mx. 

Proof. Wet c, be the number of Theorem 2. Let c =1 if % is 
archimedean, and let co = Npo, if po is the prime of vp). Let d’ be 
an Mx-divisor which differs from d only at 2 (i.e. d/(v) = d(v) for 
v # Vv») and such that 


Vers ||| db’ ae Ss ley. 


If v) is archimedean, we can adjust the v,.-component as we please, 
in a continuous fashion. If v is discrete, the value group ranges 
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over powers of Np), and so we can also find the b’ subject to our 
condition. We set c(v) = co/c:. 

By Theorem 2, 4(d') > 1 and hence there exists an element a+0 
in L(b’), that is, ||@||,<{|d’ ||. for all ve Mg. We put 8 =1la. 


Then the inequality on the left in the lemma is clearly satisfied. 
For the inequality on the right, we have 


' [| Bd" IIx ’ ' 
OD ls eee = (1 BD le = 
|| BD" || TL I! 6’ lic S || BD |lx = []0 IIx 


wEY 
for allvue Mz. The product is taken over all we Mz, wv. Since 
d’ is like b except at vw, we have also proved the inequality on the 
right. 

Returning to the main proof, we note that if ve S, then the value 
group of v is infinite cyclic, generated by Np, and there is only a 
finite number of primes such that Np < c(v,). Consequently, there 
is a finite set of absolute values S’ > S such that for v ¢ S’, we 
have 


1 = || 6d ||. = {| B lls 


in the lemma. Furthermore, there is only a finite number of pos- 
sible sets of values 


[|Ad||=l1B|]l, veS’—S. 


Let #:,-:-,8,- realize this finite number. Given any d and any £8 
satisfying the relation in the lemma, there exists some £; such 
that 


Ballo = 118 lle 


for all ve S’—S. This implies that @ = Bju with some we Ks. 
Substituting in the lemma, we get 


|| 2d ||» S c(vo)/II Bi Ilo 


for all v+w in Mx. As the ,,---, 8, are fixed, we vary D so 
that div) = 1 for v € S and Dd(v) is very large for all ve S. Then 
|| z ||» is very small for allveSandv+#v. By the product formula 
|| ||,, must be large. This achieves the first of our objectives. 

(The reader will observe that the crucial point was the finiteness 
of the p with Np bounded. Otherwise, the argument is formal, 
and applies for instance to function fields in one variable over a 
finite constant field. We leave this as an exercise.) 
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As to the second, we have found 4, +++, %.-1¢€ As such that 
log m1 = (Eu, eis) 


log %.-1 = es, aries cee iee) 
and such that the matrix of signs of the &;; is as follows: 
+ eS 
= + =— — 
— 1 ees, + — 
Let Yi,°°:, Ys be the column vectors. We must show that the 


first s — 1 are linearly independent over R. Suppose that 
ay, ap aoe ap Gs a—-1 = 0 


not all the coefficients being 0. Say a, > 0 and a, 2a; for any j. 
Then looking at the sum just in the first row, we get 


0= aE oF QE 12 ap 900 SF PE ras 
2 Eu + QE. + +++ + E151 
= aE, + Ere + oe+ + &i,5-1) 


because &,; is negative for 7 =2,---,s—1. By the product formula 
we must have &, + &i2 + +++ + &1,s-1 > 0, a contradiction. 

For the convenience of the reader, we repeat the proof that a 
discrete subgroup of Euclidean space is a free abelian group. We 
do this by induction on the dimension of the subgroup, i.e. the 
maximal number of linearly independent elements over R. 

Let © be our subgroup and &,, ---,&m a maximal set of independ- 
ent vectors in I’. Let 7, be the subgroup of I’ contained in the 
subspace spanned by &1,-+:,&m-1:. By induction, we may assume 
that any vector of 7) is a linear integral combination of &,, +--+, &m-y. 

Consider the subset © of all EET of the form 


E=abit +++ + anEm 
with real coefficients a; satisfying 


O'S Gen t=1,-:-,m—1 


VS¢g2e 


It is a bounded set. Let &, be a vector of & with the smallest 
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possible a, #0, say 
= hE, + See + OmEm 


Starting with any vector & of T, we can select integral coefficients 
C1, °°*,Cm in such a way that 


S =o = CnEm — CE Seep Oa Cn Cat 


lies in ©, and the coefficient of &, is less than b, and =0. This 
coefficient must therefore be 0, and &’ lies in 7. From this our 
result is clear. 


§4. Relative units and ideal classes 


The results of this section will not be used anywhere else in this 
book, except in Theorem:’8 of Chapter VII, § 4. 

Let K be a field with a proper set of discrete valuations M. We 
denote by Uy, Du, Cy the associated group of units, divisors (or 
ideals, same thing in this case), and divisor classes. We then have 
by definition, an exact sequence 


0- Uy > K* > Dy > Cy > 0. 
Let M’ be a subset of M. Then we have the obvious surjection 
Dy > Dy > 0 
whose kernel is Dy—y. Hence we have the following exact and 
commutative diagram: 


0 


J 
Dy-a 
i) 
0—- Uy ~ K*> Dy — Cu -0 (2) 
L 
> he Dy > Cy 0 


| 
0 


From this one gets in the usual manner two exact sequences: 


0— Uy Uy — Du-ur (3) 
Du-ur > Cy > Cyr > 0 (4) 


obtained by trivial chasing around the diagram, which we shall 
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leave to the reader. As an application, we get: 


Proposition 5. Suppose that M’' differs from M by a finite number 
of elements. Then Uy (resp. Cu) is finitely generated if and only 
if Uy (resp. Cy’) is finitely generated. 


Tueorem 4. Let k be a field, V a normal variety scheme over k, 
and M the set of discrete valuations of k(V) corresponding to the 
prime divisors of V rational over k. Let Uy be its set of units in 
k(V)*. Then Uxylk* is finitely generated. 

Proof. If Vis complete, then Uy, = k* and our assertion is trivial. 
In general, by considering an affine open subset of V whose set 
of prime rational divisors differs from that of V by a finite set, 
we may assume that V is affine. Then one may embed V ina 
projective k-normal variety V whose set of prime divisors over k 
again differs from that of V by a finite set. We can now use the 
exact sequence (3), modified as follows: 

> Ux/k* ae Uy |R* — Dy-w 
to conclude the proof. 


Corotuary. Let k be a field, K a finitely generated regular ex- 
tension of k, and S a subalgebra of K finitely generated as an alge- 
bra over k. Let S* be the invertible elements of S. Then S*/k* is 
a finitely generated group. 

Proof. Our algebra determines a variety, and by going to an 
affine open subset of this variety corresponding to a larger ring 
Si, we may assume that this variety is non-singular in codimen- 
sion 1. (Indeed, S*/k* is canonically contained in Sy/k*.) The in- 
vertible elements of S, coincide with the units for its set of prime 
divisors, and we can apply the theorem. 

Let R be a Noetherian ring, integrally closed, let V be a variety 
scheme over R and assume that V is normal. Using the notation 
of §1, let Mg=Mnry be the subset of Mz.y consisting of those 
elements of Mz having a unique extension to M,, and let My = 
Mz U M,.y where k is the quotient field of R. Then we have an 
exact and commutative diagram: 
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0 0 0 
1 J L 
Un, Un:, Upegpe sk” 
i} Ab L 
O>- k — K* > K*/k* 30 
i ' I (5) 
Oe De > Du, , —0 
U u 1 
Cup Cu, Cu, 
L L 1 
0 0 0 


and consequently an exact sequence 
Ux, car Ux: =? Ou, [k™ are, Cu, = Cur, a, oe (6) 


where the middle map is the usual coboundary operator (cf. [3]). 

If we now apply Theorem 4 to the first part of this sequence, 
together with the finite generation of the unit group in number 
fields, we get: 


THEOREM 5. Let V be a normal variety scheme over the ring of 
integers I of an algebraic number field k. Then the group of units 
Ux, of its field of rational functions is finitely generated. 


Corotuary. Let R be a finitely generated ring over Z. Then the 
invertible elements R* of R form a finitely generated group. 

Proof. Using Proposition 2 of §1, we may embed R in an 
integrally closed ring of finite type over Z, and it clearly suffices 
to prove our assertion in this case. Our ring then determines a 
variety scheme as in the theorem, and its invertible elements are 
equal to the M,-units, so that the theorem applies. 

It will be a comparatively deep theorem that the group Cy, , is 
finitely generated whenever & is a field of finite type over the prime 
field. Once this will be proved, however, (Chapter V) we shall 
see, from the end of the exact sequence (6) together with the finite- 
ness of class number theorem in number fields (Theorem 1 of § 2), 
that we can derive a similar statement for the divisor classes: 


Tueorem 6. Let V be a normal projective variety scheme over the 
ring of integers of a number field k. Let My be the set of discrete 
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valuations corresponding to its prime divisors, as in Example 3 of §1. 
Then the group Cu, of divisor classes is finitely generated. 

We apply sequence (6) using the set My which differs from My 
by a finite subset. This of course makes no difference to the finite 
generation, in view of sequence (4). 


Historical Note: 


It was Artin-Whaples [2] who first showed how the product 
formula can be taken as the basic axiom for algebraic number 
theory. For us here, it is irrelevant to know that if one of the 
primes is archimedean or discrete and its residue class field is finite 
over the constant field, then this characterizes the field as being 
either a number field or a function field in one variable. 

Divisors, including possibly archimedean primes, were defined by 
Weil [40], with the idea of making his decomposition theorem 
valid for them also. We shall reproduce this theorem in Chapter 
Vil. 

The theorem giving the approximate number of algebraic numbers 
(Theorem 2), as well as the proof of the unit theorem, has been 
taken from Artin-Whaples. In this respect, one must emphasize 
that the finiteness of the ideal class group, and the finite genera- 
tion of the unit group are utter trivialities. In the case of the 
units, this is mot the content of the unit theorem. Its content 
lies first in giving the precise rank, but even more in asserting 
that the log vectors, as in Theorem 3, are linearly independent 
over R which is saying much more. In some applications, it is 
necessary to have the unit theorem as we have given it. How- 
ever, in §4, or for applications to Chapter VI, where we are in- 
terested only in qualitative results, the finite generation would 
suffice. For the ideal class group, there is an analogue to the 
quantitative result which we have given here for the unit group, 
i.e. the more precise determination of the Minkowski constant of 
Theorem 1. We shall not go into this here. 

One should note that Theorem 2 can be improved to give an 
asymptotic estimate for the number of elements in K in L(bd), 
namely, we have 


Ad) = ex || Dlx + o(|| D IIx) 


for ||D||<—>0o. This is more or less known, but is not explicitly 
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in the literature. One reduces the proof to counting lattice points 
in parallelotopes in Euclidean N-space (where N=[K:Q]) by means 
of the finiteness of the class number and Lemma 2. 

One can then use a simple direct argument. We shall not go 
further into this here. 

The contents of §4 are taken from Roquette [30]. 

Finally we observe that there are product formulas arising from 
situations more general than that of Example 2. Indeed, let us 
take a complete, non-singular surface V and a curve C on V 
which has the following property: For every positive l-cycle X 
on V, the intersection number /(X.C) of the numerical equivalence 
classes of X and C is = 0. For every prime rational cycle p on 
V we define the multiplicity d(p)=J(p.C). If ¢ is a function on 
V, then we define 


\ai= e-4(p) ordy (¢) | 


The set of such absolute values does satisfy the product formula. 
If the curve is a generic hyperplane section, then we have our 
standard example. However, as Mumford pointed out to me, there 
exist examples of such curves which do not lie in a pencil, and 
thus give rise to other types of product formulas, to which Roth’s 
theorem would also apply. 


CHAPTER Ill 


Heights 


The possibility of defining the height of a point on a variety lies 
at the base of all possibilities of counting such points. In this 
book, this allows us to get qualitative results, to the effect that 
certain sets of points are finite, or, if they form a group, a finitely 
generated one. 

One can give the definition of the height just after the axiom 
that we have a field with a proper set of absolute values satisfying 
the product formula. We can also prove important geometric proper- 
ties of the height (how it varies under certain mappings) using 
only this axiom. The formalism we shall develop will then be used 
in the infinite descent of Chapter V, and in the counting of Siegel’s 
theorem in Chapter VII. 

In number fields, the set of points in a given projective space of 
bounded degree and bounded height is finite. We shall give in §3 
the analogue in function fields, where we prove that certain varie- 
ties lie in a finite number of algebraic families. 


§1. Definitions 


Let F be a field with a proper set of absolute values Mf, satis- 
fying the product formula. Then for each finite extension K of F, 
the set of absolute values on AK extending those of Mp is a proper 
set Mx, satisfying the product formula with multiplicities N, = 
[K,:F,] for ve Mz. As before, we set 


lx \leemnlie*  xeK. 
We now consider projective n-space P”, and let Pé Pz be a point 
in projective space, rational over K. Let (%,::+,%n) be a set of 


coordinates for P, with each x;¢ K. Then we define the height of 
P relative to Mx by the formula 


Hy,(P) = [I supi || x Ilo. 


vem 


In view of the product formula, one sees immediately that our 
product is independent of the choice of coordinates chosen: If 


43 
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yeékK* then 
sup; || 97 |ls = | esube rn ile 


and taking the product, the term involving y will be equal to 1. 

In particular, we define the height of an element x¢K to be the 
height of the point (1, x) in P’, so that we have 

Hu,(*) = TI supd, || x ll), 
vEME 
and we see that if x +0, then Hy,(x) = Hu,(1/x). 

For convenience, if the set Mp is fixed throughout, we write 
simply Hx or H, if the reference to K is also fixed throughout a 
discussion. 

Let us suppose that our set Mf is fixed. Let E > K be two finite 
extensions of F, and let (%,---,%,) be coordinates of a point Pin 
P", rational over K. Then one has 


H,(P) = Hx(P)***'. 
Indeed, letting N. =[E.:F.] for we Ma, we get 
HAP) = Tl saps (472° = Tet supra” 


wEM p VEM yp wiv 


On the other hand, 
lB j ES = [eas : KONE : F.| 


For all absolute values w inducing the same absolute value v on 
kK, the term [K,:F.] is constant, and we have 

>) 4a: Ay] = (2: K]. 

wv 
From this our assertion is obvious. 

The formula we have just proved shows how the height changes 

as we extend the field. It is therefore natural to define an absolute 
height of a point P, rational over the algebraic closure of F, by 


h(P) = hu(P) = Hy( PF 


for any finite extension AK of F over which P is rational. The 
argument we have just given shows that A(P) is independent of 
the choice of this field. 

Let o be an isomorphism of & over F (i.e. leaving F fixed), Let 
P be a point as above, with coordinates (x, ---+,x,) rational over 
K. Then we can define the point P’, rational over K’, and having 
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coordinates (x,---,x;). By transport of structure, we get im- 
mediately 


Ax(P) = Hgo(P”) 
whence in particular, 
RP) =hP”*) . 


Let us now consider an example. We take F =Q, the rational 
numbers. We can write any non-zero rational number in the form 
a/b where a, b are ralatively prime integers. If p is a prime num- 
ber then |a@|, and |5|, are <1, and one of them =1. Taking the 
canonical set of absolute values on Q to define the height, we see 
that 


HQ(a/b) = sup (|a@|,| 51). 
More generally, any point in projective m-space rational over Q has 


a set of coordinates (xo, - - - , x.) which are relatively prime integers, 
and we then see that 


Ho(P) = sup; | i | 


the absolute value being the ordinary one. In particular, the set 
of points in P” rational over Q, and of height < a fixed number 
is finite. We shall generalize this later to a number field. We 
agree from now on that the heights in number fields will always be 
those relative to the canonical set of absolute values on Q. 


§2. Gauss’ lemma 


Let K be a field with a non-trivial absolute value v. If f(X)= 
SavM.(X) is a polynomial in several variables with coefficients 
a,é K, and 


MUX) = Sa *, Xn) = >, CLEA ee, Gi 
a monomial, then we define 


Iflo =|f | =supy|ay|. 


Gauss’ Lemma for valuations then asserts that this is a valuation 
if v is a valuation: 


Proposition 1. Let K be a field with a non-trivial valuation. If 
f,g€K[X,-++, Xn] then 
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Proof: Let us first assume that f, g are polynomials in one 
variable, so we can write 


F(X) = agX* + +++ +a 
20) = 0X = 
Say a, is a coefficient of f such that |a,| 2 a; for all z, and is the 
one which lies furthest to the left that has this property. Similarly 
for b, with respect to the b;. Then 
(i/a,)f = (aala,)X* ie oe ae OG SP 000 =F Q/ay 
(1/b.)g = (b./bs).X ° Ss oe, Ole i bo/bs ’ 
and the coefficients of these polynomials each have a value <1. 


Those to the left of X” (resp. X°) have a value <l. If we take 
the product 


(1/a-b.)fg 


then we see that X’ X°* has a coefficient of type 1+ c¢ where 
|c| <1. The coefficient of every power X* with 1>r+s has a 
value <Jl, while if 7< 7+, then its value is <1. From this one 
sees that 


| (1/a,b.)fg | =1 


and hence that |fg|=|a,b,|, which proves our proposition for 
polynomials in one variable. 
Now let f be a polynomial in ” variables X,,---,Xn, of degree 
<d. Then the polynomial in one variable 
Sf) Qa Vi aeys =) 


has the same set of non-zero coefficients as f. Thus, if f and g are 


two polynomials in ” variables, such that the sum of their degrees 
is <d, then 


Sal fg) = Si(F)Salg) 


has the same non-zero coefficients as fg. From this our reduction 
of the -variable case to the 1-variable case is clear. 

For an archimedean absolute value, we do not get the full multi- 
plicativity, but we shall see that we get a modified version of it. 
Let us first consider the case of one variable. 
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Lemma 1. Let K be a field of characteristic 0, with an absolute 
value which coincides with the ordinary one on Q. Let f(X)&€K[X] 
be a polynomial of degree d, and let 


A(X) = IL ean 


be a factorization in K. We assume that our absolute value is extended 
to K. Then 


Il & d 
52 TI sup(, | a |) = |f| S$ 2° TJ supd, | a: |). 
ae i=1 


Proof: The right inequality is trivially proved by induction, 
estimating the coefficients in a product of a polynomial g(X) by 
(X — a). We prove the other by induction on the number of indices 
z such |a;|>2. If |a;| <2 for all 7, our assertion is obvious. 
Suppose now that f(X) = g(X)(X — a) with |a|>2 and suppose 
that our assertion is true for g(X) = X? + d)4.X*'+.---+h. We 
have 


FX) =X + Gar — @)X* + (baa — aba) XO" + + + +(—@)bo . 
We can assume |g| =|0;| 2 |0;-1| for some i, 0 <i <d (with the 
convention bg = 1,b-,=0). Then 
lf | 2 lab; —6-,| 2|a}|b;| —|o-1 | 
ee le | — |b; (=) a | — 1)| 83} 
= |a|/2-\b;|=|a@|/2-\¢| 
and our lemma is now obvious, since | a| > 2. 

Proposition 2. Let K be a field of characteristic 0, with an absolute 
value which coincides with the ordinary one on Q. Let d be an 
integer =0. Jf f and g are two polynomials in K(|X1,+-+, Xn] such 
_that deg f + deg g < d, then 


i lfelsifilels4" fel. 


Proof: Using the substitution Sz as before, we are reduced to 
the case of one variable with m=1. Say 


f(X) = a TI (X — a) 
g(X) = by] (X—8;). 


Without loss of generality, we may assume a = } = 1, and that we 
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have extended our absolute value to K. Using the lemma, we get 
lf Il g| S 2° J] supd, | a: |) T] supd., | 8; |) 
oo ye le 


The other inequality is proved in the same way. 
Let F again be a field with a proper set of absolute values Mp 
satisfying the product formula. We consider a polynomia 


F(X) = 3) aiMj(X) 


in several variables, with coefficients in F, and thus in a finite 
extension K of F. We define its absolute height h(f) to be the 
height A(P) of the point P having the a; (in any order) as coordi- 
nates, and we define its relative height Hx(f) in a similar way. 

From Proposition 2 we can now deduce analogous results for 
heights. 


Proposition 3. Let F be a field with a proper set of absolute values 
My satisfying the product formula, and let s be the number of archi- 
medean absolute values in My. Let f and g be two polynomials in 
n variables with coefficients in F, with deg f + deg g<d. Then 


a h( fg) S Wf )h(g) < 4h fe) . 


Proof: Let c, = 1/4 and c. = 4", Let K be a finite extension 


of F in which f and g have their coefficients. For ve Mx, put 
If llo=Ifl?*. We have 


Hx(fg)= Il fg lle Il WF lle lg ll» ew 


where c, = 1 if v is non-archimedean, and otherwise, c, = c’”. Let 
Sx be the set of archimedean absolute values in Mz. Then 
eS N, = [A F] Si 


vES 
whence 
— ,»KiFis 
Cy ae (é 
le =< 
and the inequality on the left follows immediately. The one on 
the right follows in a similar way. 


Let f(X) be a as above, with coefficients in a finite extension K 
of F. If ¢ is an isomorphism of K over F then we get the poly- 
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nomial 
f°(X) = SaMK(X), 
and thus, as for points, we have 


WT) = WF). 


Let @ be algebraic over F, and let f(X) be its irreducible poly- 
nomial over F. Then 


f(X) = TI (X— a) 


where d is the degree of a over F (i.e. [F(a): F]) and the a; are 
the conjugates of a, each repeated with a suitable multiplicity in 
characteristic p. In view of the above results we get: 


Proposition 4. Let F be a field with a proper set of absolute values 
satisfying the product formula. Let d be an integer >0. Then there 
exist two numbers c,, Cc, >0 depending on d, such that if « is algebraic 
over F of degree d, and f(X) is its irreducible polynomial over F, then 

cha)’ < h(f) S eho)’. 

Let us consider the special case where F = Q is the rational 
numbers. Then h(/) is the maximum of the absolute values of the 
coefficients of the irreducible polynomial of a over Z. More gener- 
ally, consider a point (a,---,@n) in projective space, rational over 
a finite extension K of Q, of degree d. Consider the polynomial 


F(X) = aX tees + anXn. 
Then h(/) is the height of our point. Let 
a(X) = If" 


the product being taken over all distinct isomorphisms o of K over 
Q. Then g has coefficients in Q, and h(g), h(f )* have the same order 
of magnitude. We have already seen in §1 that the number of 
points with height less than a fixed number, in a projective space, 
and rational over Q is bounded. Consequently we now get: 


TueoreM 1. Let do, ho be two fixed numbers >0. Then the set of 


points P in P” algebraic over Q, and such that 


[Q(P) : Q] < d and h(\P) <hy 


is finite. 
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We note the strong uniformity in the statement of the above 
proposition. Not only is the set of points in P” rational over a 
given number field and of bounded height finite, but so is the set 
of points of bounded degree and bounded height. Actully, in the 
final applications, this stronger uniformity will not be used, because 
we shall consider only points of varieties which are rational over 
a fixed number field (for instance in the Mordell-Weil theorem, or 
the Siegel theore:n). 


§ 3. Heights in function fields 


We shall consider the height function from a geometric point of 
view. 

Let W be a projective variety in P’, non-singular in codimension 
1, and defined over a field k. Let c be a number, 0<c<1. Let 
Mx = Miw) be the set of discrete absolute values of the functon 
field k(W) obtained from the prime rational divisors of W over R. 
We then have by definition 


|x lp = cOrdpz) deg(p) 


for each such prime divisor p, and our set Mg satisfies the product 
formula. If P is a point in P” rational over k(W) with coordinates 
(Yo, °° *,¥n) in R(W), then 


Hiw)(P) = Hw(P) = JJ supi| vi. 
p 


Proposition 5. Let d=degsup; (yi). be the projective degree in 
P” of the sup of the polar divisors of the y;. Then 


Hy(P) = (1/c)* . 
Proof: Without loss of generality, we may assume that one of 
the y; is equal to 1. In our product defining the height, we will 


have a contribution +1 only if p is a pole of one of the y;. For 
such a pole p we have 


sup; c Ordyy,) deg (p) — sup,(1/c)'-or4pya) deg (p) 
= (1/c)s4Ps(—ordpyg)deg(p) 


Our assertion is now obvious. 

We shall call d = deg sup,(y;). the logarithmic height, (or simply 
height if we work with it throughout a discussion), and we shall 
denote it by yw(P). It can also be interpreted in the following 
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manner. 

Proposition 6. Let W be a projective variety in P”, non-singular 
in codimension 1, defined over the field k, and let P be a point in 
projective space P", rational over k(W). Let f: WP” be the rational 
map defined over k, determined by P. Then 


nw(P) = deg f *(L) 


for any hyperplane L of P", such that f ‘(L) is defined, the degree 
being that in the given projective embedding of W in P”. 

Proof: Without loss of generality, we may assume none of the 
coordinates (yo,-++,yn) of Pis 0. Let (Yo,--+, Yn) be the varia- 
bles of P”. Each Y,/L(Y) determines a function ¢; on P", i.e. a 
rational map ¢;:P”— P’. We have two successive maps: 


WS Bs Pp 
and if L(y) #0 (i.e., f-'(L) is defined) then y,/L(y) determines the 


composition function g;of =~; of W into P’. By AV, Corollary 1 
of Proposition 2, Appendix, §1, we get 


@)=f (i)—f (L), 
where L; is the hyperplane Y;=0 of P”. Since the L; are without 


| common point, the divisors f-‘(L;) are without common component, 
| and hence 


supi(Pi)o = f a) : 


| This proves our proposition. 
We observe that our logarithmic height, and thus the height, 
are geometric, i.e. essentially do not depend on the field of defi- 
i nition of W. In particular, our point P being also rational over 
k(W), we have 


| Hiw)(P) = Ai (P) « 
Let W be as before, and let P be a point in P” rational over k(W). 
Then P defines a generically surjective rational map 
gow ST 
of W, such that if 7: T—P” is the inclusion, then jog=/f. If x 
-is a generic point of W over k, we have f(x) =g(x). This is a 


igeneric point of T over k. 
Since T is contained in P”, it has a degree in P” which we shall 
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also write deg T or deg,,7 if there is any chance of confusion 
with respect to the embedding. We wish to give a rough compari- 
son of yw(P) and deg T. To do this, we need some lemmas. 

We start therefore with a generically surjective rational map 
g:W-T defined over k. Let dim T=s, and let Lu,,--+-, Lu, be 
generic independent hyperplanes of P” over k. This means that 
ui, ***, Us, are independent generic over & (cf. IAG, Chapter VIII, 
§6). We also write L; instead of L.,, Then T-L,---L, isacycle 
on P” consisting of deg TJ points, which are all generic points of 
T over k, by IAG, Proposition 10 of Chapter VIII, §6. Furthermore, 
each one of these points is separable over k(u:,--+,u.), say by the 
criterion for multiplicity 1. 

If y is a generic point of T over & and y = f(x) = g(x), then g"“(y) 
is the prime rational cycle on W equal to the locus of x over k(y), 
taken with multiplicity 1 by F’-VII, Th. 12, i.e. it is the “generic 
fiber”. (It is not necessarily a variety, i.e. absolutely irreducible.) 

Lemma 2. Let W be a projective variety in P”™. Let g:W—->T°* 
be a generically surjective rational map defined over k. Let y be a 
generic point. of T over k, and let U=g™‘(y). Let 7: T—P" bean 
inclusion, and let L,---L, be generic independent hyperplanes of P” 
over k. Then 


(deg T)(deg U) = degg (T-L,---L,). 


Proof: Let T-L,-+--L,=>35(9;). Then deg T is the number 
of these points (y,), and the cycle on the right is precisely |} U; = 
dg '(yi). The U; are all conjugate over & (i.e. each one can be 
transformed into the other by an automorphism of the universal 
domain) and hence have the same degree in P”, namely the degree 
of U. Our assertion is therefore clear. 

Lemma 3. Let the notation be as in the previous lemma, and f = 
jog. Suppose that y is one of the points of T-L,-++Ls. Then 
each component of U =g™'(y) is a proper component of 


FUL) 02° 1 f 
and we have 
Foe IE Ly * ee L,) = 5) gad me 0 ES) : 

Proof: Let U, be a component of U and let Uy be a component 
of f-'(Li) Nn +++n f(L,) containing U, (which is itself contained 
in their intersection). Observe that f and g are both defined along 
U, (since a generic point for U, is also one for U over k(y) and is 
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a generic point for W over k). Hence f and g are defined along 
Us. Let x’ be a generic point of Us over the algebraic closure of 
R(u,,+ ++, Us) (notation as in the beginning of the discussion). Then 
f(x’) lies in TN Li --+AL, and hence is one of the points y of 
T-L,---L,. Since f(x’) = g(x’), it follows that x’ lies in supp (U), 
and thus is one of the components of U. Hence U{ itself is con- 
tained in this component, which must therefore be Up, and we get 
C—O 
To prove the second assertion, note that 


Pod) 7 a) 


is rational over k(w,+--,u,) and as we mentioned before, each 
point y in T-L,---L, is separable over this field. Since g~‘(y) is 
prime rational over A(w, ---,us)(y) (which is the composite of A(y) 


and a purely transcendental extension of k, free from k(y) over R), 
it follows that 


Spee (Le Fee 


If 1, ve are two distinct points y as above, then g~'(y,) has no 
component in common with g‘(y.). Hence we get 


Pen Lh.) = (L,)«- CE) 
as desired. 


Lemma 4. Let W be a projective variety in a projective space P”. 
Let X,,---,Xs be positive divisors on W. Then 


deg(X,-- +X.) S (deg Xi): - - (deg X,) . 


Proof: Let W and the X; be rational over an algebraically closed 
field k. We take a generic linear direction over k as in F’-I[X; such 
that the join of this linear variety with a divisor on V isa divisor 
on P”. We call this join also the projecting cone over the divisor, 
as usual. We recall the following facts. If A is a subvariety 
of W defined over k, and A the projecting cone over A, then deg 
A =deg A. If A and Bare subvarieties of W defined over k, and 
A +B, then A # B by F’-IXs, Proposition 6, Corollary 3._ If A is 
a proper components of X:---X; of multiplicity m, then A is also 
a proper component of X,---+X, of multiplicity m, by the corollary 
to Proposition 7 of F’-IX;, combined with F’-VIII,, Theorems 10 
and 11. The intersection of the X; is of course taken on P”. 
From these facts, it follows that 
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deg(X.- ++ X,) < deg(X- « - X,) 


and this reduces the proof to the case where W is equal to all of 
projective space. 
We then proceed by induction on s. By definition, we have 


CG as X,) = (Xi = 1) * Xs 


Thus it will suffice to prove our assertion for two factors, say X 
and Y, one of which is a divisor in P”. If every component of 
the intersection X  Y is proper, we can apply Bezout’s theorem. 
Otherwise, we proceed by induction on the sum of the multiplici- 
ties of the components of X and Y. If both X and Y are varieties. 
with X of codimension 1 in P”, then either X > Y in which case 
X:Y=0, or X > Y and Bezout’s theorem applies. In any case, 
our assertion is obvious. Say now X = X, + A where A is a variety, 
and xX, > 0.” Then 


X-Y=(4%,4+ A). Y=Xi-YrAyY. 
By induction, we get 
deg(X,- Y) + deg(A- Y) S (deg X,)\(deg Y) + (deg A)(deg Y) 
which proves the lemma. 


TuHEoREM 2. Let W" be a projective variety, non-singular in codi- 
mension 1 and defined over a field k. Let P be a point in P” rational 
over k(W), and let T be the locus of P over k, so that we have a 
generically surjective rational map g:W—>T. Then 


degp»T S nw(P)’. 


Proof: This follows immediately from the preceding lemmas, 
using X; = f~'(L,) and applying Proposition 6. 

In particular, we see that if P ranges over a set of points of 
bounded heights, then 7 has bounded degree in P”. 


§4. Heights on abelian groups 


This section is entirely different from the preceding three. In 
some applications, e.g. to abelian varieties, the height function 
becomes related to the group law in certain ways, and it is con- 
venient to make an abstraction of this situation. Rather than deal 
with the height function itself, we prefer to axiomatize the situa- 
tion for its logarithm. The results proved below are purely com- 
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binatorial. 

Let © be an abelian group. Let d be a real number >0. Let 
7:I’—R be a real-valued function on 7’. We shall say that 7 is 
of guasi-degree d if y(P)=0 for all Pel and if it satisfies the 
following condition: 

Given an integer m>0, and ereal>O, and an element acT, 
there exists a number c > 0 such that for all PET we have 


—c + (m* — e)7y(P) S pmP+4+a)Sc+(m'*+e)y(P). 


We shall say that 7 is non-degenerate if given 8 > 0 there exists only 
a finite number of elements Pel such that 7(P) < P. 

The arguments used to prove the next proposition are known as 
the infinite descent. 


Proposition 7. Let I’ be an abelian group and y a real valued 
function on T’ of quasi-degree d>0. Let m be an integer >1 such 
that T'/mT is finite, and let a;,:++,as, be representatives in T' of 
T'/mI’. There exists a number c, and a subset © of I’ such that: 

Gh) 7) = 6c. for all Pes. 

(ii) For any P,€T, there exist integers no,m,,-++,N, and a point 
P in © such that 


Py = MP + m0; + ++ ° + Gs - 


Proof: In the definition of quasi-degree, the number c depends 
on m and a, so let us write c(m,a) for it. In the present applica- 
tion, we let c = sup; c(m, a;). 

Consider a sequence (Po, P,,---) of points of I constructed by 
starting with our point P), and such that 


i by — Gy, 
By definition, there is a number 2>1 (in fact, 2 = m*—e) such 
that 
(Pryi1) S y(Prv)/A+ c/a. 
Proceeding inductively, we get 
n(Py) S 1/2” + o(Po) + cA + 1/4 + 1/2? + +++) 
and from this our assertion is obvious. 


Corotuary. Let I be an abelian group and 7 a real-valued function 
on T’ of quasi-degree d > 0, and non-degenerate. Let m be an integer 
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>1 such that /mT is finite. Then T' is finitely generated. 
Proof: Immediate consequence of the definition of non-degeneracy. 
The procedure in the proof of Proposition 7 can be reversed and 
leads to an infinite-ascent, which we use to estimate the number 
of elements Pel’ such that 7(P) = 8 as 8 becomes large. 


Tueorem 3. Let I' be a finitely generated abelian group, of rank 
r (maximum number of linearly independent elements over Z). Let 
7 be a real-valued function on I’, which is of quasi-degree d > 1 and 
non-degenerate. Given ¢ > 0 there exist two numbers ci(e), c(e) > 0 
and a number 6 with 0 < 6 <« such that for all B > B(e), the num- 
ber N(8) of elements of I" such that y(P) S 8 satisfies the inequalities 


c,(e)p7/*-* < MB) eS co(e)p/2*? 


Proof: We can write as a direct sum of a free abelian group 
and a finite torsion group. From the definitions, and the approxi- 
mate nature of our estimates, we may assume without loss of 
generality that I’ is free over Z. 

Let m be an integer =2 and ~,---x, a basis for / over Z. Let 
{a;} range over those elements of / which can be written in the 
form 


NX, rte + NX 


--+, Py) of distinct points of /’ such that 


with —(m—1) Sn; S(m-—1). By oy we denote a sequence (Pp, Pi, 


Po = Gj, 
P, = mP, + 4, 


FO — ea 


Taking into account the m-adic expansion of an integer, we see 
that the number of elements of /’ which can occur in some sequence 
o, is precisely (2m” — 1)" because their coefficients in Z, in terms 
of our basis, range from —(m” — 1) to +(m” — 1). 

Since 7 is non-degenerate, given e >0 there exists an integer 
¥o = vo(m, €) such that for all » =») and any point P, in a sequence 
oy we have 


(m* — e)y(Py-1) S 0 Py) S (m? + e)9(Py-1) . 


Inductively, we get 
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i) (m* — e)*°y( Py.) S (Ps) S (m* + &)y(P,,) . 


Put co(m, e) = log y(Py,)/log(m* + e). 
Ibe 5) 22 py eyavel she 


log B 


aa A= log(m* + «) 


ae = Colm, é) 
then by an obvious logarithm, we see from the right inequality of 
(1) that 


Wy) = Bas 


Consequently, for such v, we get a lower bound for M8), namely 
(2m” — 1)” which we replace for convenience by m’’/2”. We see 
that for all 8 = B.(m,e) we get 


c:(m, eB" = N(B) 


where 4 = log m/log(m’ + «). 
On the other hand, if » =») and 


log 8 


eG a Com, &) + Yo 


(3) 
with an obvious number ci(m, «), then by the left inequality of (1) 
we see that 


Cea) Be 


Consequently, all the P with 7(P) S 8 must lie in sequences o, for 
vy satisfying 
log 8 


, 
ee ay com, €) + vo. 


(4) v 


iA 


This gives us an upper bound for M{&) using 2’m’" as an upper 
bound for the number of elements in such sequences, namely, 


N(8) S ex(m, e) B”™ 


where 4’ = log m/log(m* — «). Since m, ¢ where fixed at the begin- 
ning of our proof, we can now take m large and « small, so that 
the assertion in our theorem is clear. 


Historical Note: 


The necessity of counting points rational over a number field 
led Weil to his decomposition theorem [41]. Siegel, in studying 
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integral points, then defined the height. An analysis of all the 
proofs in this subject shows that the crucial step is always the 
study of sets of bounded height, as well in the Mordell-Weil theo- 
rem as in the Siegel theorem, or the Roth theorem. We have given 
heights here under a maximally axiomatized situation inspired by 
Weil’s treatment of heights in number fields [40]. 

With a view to applying this theory to the theorem of the base, 
Neron [24], [25] studied heights in function fields, where it was 
again taken up in [19]. The geometric interpretations were already 
given in [25], especially Theorem 2. 

Originally, the height was only considered relatively. It was 
Northcott [26] who observed that by taking a suitable root, one 
could get it uniformly for algebraic extensions as well, and could 
extend to these the theorem, in number fields, that points of bounded 
degree and bounded height form a finite set. This is not used in 
our applications, however, since we are always concerned with 
points rational over the given ground field. 

It is interesting to try to give an estimate for the number of elem- 
ents of a number field K of height <B for B>o, Letr=7,+7,—-1 
(where 7, is the number of real absolute values and 7, the number 
of complex ones). If one asks for the number of integers or units of 
height <B, then one is again reduced to computing the number of 
lattice points in a region. For the integers, the volume of this 
region is of type 


Bico + ci(log B) + -++ + c,(log B)’) 


where c; are constants >0 (combinations of powers of 2 and z). 
As Cassels pointed out to me, one should then use Davenport’s lemma 
[7] to compute the number of lattice points in this region. One 
obtains then the asymptotic estimate cx, B(log B)" for the number 
of integers of height <B, and similarly, an estimate cx (log B)" 
for the units of height < B. 

Over the rationals, there is of course no problem for the integers 
and units, and it is a standard result of elementary number theory 
that the number of relatively prime pairs of integers of absolute 
value <B satisfies an estimate of the type cgB’ with an error term 
O(B log B). 

Gauss’ lemma is included here partly for its application to Roth’s 
theorem, where it allows us to start the induction trivially. It is 
a very interesting question to try, in higher dimensions, to givea 
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relation between the absolute values of the coefficients of a poly- 
nomial and the height of the rational points on the variety de- 
termined by it. 

Proposition 7 is the usual infinite descent of the Mordell-Weil 
theorem properly axiomatized (cf. the historical note of Chapter V). 
Similarly, Theorem 3 and its proof are taken from Néron’s thesis 
[24]. 


CHAPTER IV 
Properties of Heights 


Throughout this chapter, F is a field with a proper set of absolute 
values My satisfying the product formula. We denote by K some 
finite extension of F, so that the set Mx satisfies the product formula 
with multiplicities N,,vé Mr. 

If V is a variety defined over K, then for each morphism of V 
into a projective space also defined over K, we can define the ab- 
solute height of a point of V algebraic over K, and its relative 
height if it is rational over K. 

We shall assume that V is complete and normal. Maps into pro- 
jective spaces are then given by linear systems, and it will be our 
main purpose to give conditions under which the corresponding 
height functions are equivalent or quasi-equivalent (for the defini- 
tion, see §1). We shall formulate our theorems for the absolute 
height #, and it will then be obvious that they remain valid for 
the relative height Hx which is a fixed power of the absolute one 
(namely the [A: F]-th power). In the applications, we need only the 
equivalence criteria for the relative height. 


§1. Functoria!l properties 


Let V be a variety. Two real-valued functions 4 and 24’ on a set 
of points of V will be said to be equivalent if there exist two num- 
bers c:, C2 > 0 such that for all points P in our set, we have 


CAPS AP) = cd(P) . 


We then write 4~ 2’. We define 4 and 2’ to be quasi-equivalent 
(and write 4 = 4’) if given e > 0 there exist two numbers (, c, > 0 
depending on e, such that for all points P in our set we have 


onlP)" SP) aay . 


These relations are obviously equivalence relations (symmetric, re- 


flexive, transitive). 
Let F be a field with a proper set of absolute values Mr satis- 
fying the product formula. Then for each finite extension K of F, Mx 
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satisfies the product formula with multiplicities N,. Let V be a 
variety defined over K. Let g: VP” be a morphism of V into 
projective space, defined over K. Then for each point P of Vz, 
y(P) is a point of P”, rational over K, and we can thus define its 
height which we shall denote by Hx,o(P). If P is algebraic over 
F, then there exists a finite extension K of F over which it is ra- 
tional, and we can then define its absolute height 
hf P) = h(g(P)). 
Thus Hx,, is a function on Vx while dy is a function on Vz = 
Vz, i.e. on the set of points on V rational over the algebraic 


closure of K. 
Let V be projective, defined over K. Suppose its points are rep- 


resented by homogeneous coordinates (%,-°°+,%n). Let (a:;)@ =0, 
---,m and j =0,---,#) bea matrix with coefficients in AK, and put 
Yi = AinXo + +++ + GinXn. 


Then the map (x) >(¥) defines a rational map g: V-P”. If P is 
a point with coordinates (x) such that not all y; are equal to 0 in 
the above formula, then ¢ is defined at P. A map ¢ obtained in 
the manner just described is called a linear projection, defined over 
1G 


Proposition 1. Let V be a projective variety defined over K. Let 
yg: V-P” be a linear projection also defined over K. There 
exists a number c > 0 depending only on ¢, such that if P is a point 
of V, algebraic over K, such that not all the coordinates y; above are 
0, then 

he(P) < ch(P) 


(h being the height in the given projective embedding of V). 

Proof: Let Sg be the subset of Mg containing all those absolute 
values v for which some | a@;; |, is not 1 and all archimedean abso- 
lute values. Then Sx is a finite set. If our coordinates are in a 
finite extension E of K, then for any w¢ M; extending some ve Mg, 
we have 


sup: | ys lw S Cy Sup; | %3 Ju 
where c, = (m+ 1)sup,;[| ai; |, 1] for all v¢e Sx, and cy=1 for all 
v€Sx. From this we get 
hg P) = ch(P) 
where 
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1/(#:F] 
eth I of» | Sle 


vEM RE wiv 


From our proposition, we see in particular that if V is again a 
complete variety, and if g, 9’ are two morphisms into P” which 
differ by a projective transformation defined over a finite extension 
K of F, then hg is equivalent to hy. In Property 3, we shall prove 
a much stronger criterion for equivalence. 

Let us assume now that V is complete and normal. 

We shall obtain mappings g: V-— P” by means of linear systems 
(cf. IAG, Chapter VI, or Foundations, Chapter IX). Let & be a 
linear system on V defined over K. This means that we can find 
a divisor X),»¢€ _& rational over K, such that the space of functions 
L, consisting of all functions on V whose divisors are of type 
X — X, as X ranges over & (together with 0) has a basis defined 
over K. If (fo =1,---,/m) is such a basis, then it defines a rational 
map ¢:V-P”. If & is without fixed points, then ¢ is a mor- 
phism, and conversely if ¢g is a morphism, the linear system without 
fixed component belonging to it is without fixed point. In view of 
the remark in the preceding paragraph, we see that the equivalence 
class of hy actually depends only on the linear system. 

Instead of taking a basis as above, let us take any set of gene- 


rators of the vector space Ly, defined over K, say 90,°*+,9n. Then 
each g; is a linear combination of the f; with coefficients in AK, and 
conversely. Our set (90, --+, Yn) determines a rational map 

gy’: VP", 


said to be derived from &, and it is clear that gy’ is a morphism 
if and only if g is a morphism. If this is the case, then we see 
again that hy~ he. 

Let -@ be another linear system on V, also defined over K. Then 
we can define the sum -&+-.24 asa linear system. If (fo = 1,---, fm) 
is a basis for L, over K, and (go = 1, °::,9n) isa basis for M, over 
K obtained in a similar manner, then we get a vector space of 
functions N, generated by the products fig;. (These fig; need not 
be linearly independent over K, however.) We have fog =1. The 
divisor of f:g; is 

(figs) = (Fi) + G5) - 
If we write (f;:) = X; — Xo and (g;) = Y; — Yo with Xi, X,¢& and 
Y;, Y.¢-A4 then 
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(figs) = Xi + Y3 — (Xo + Yo). 


The vector space Ny gives rise to a linear system -” called the 
sum of & and & If H and-4 are both without fixed points, 
so is their sum. 

Let ¢: V-P”™ and go’: V-P” be the two rational maps derived 
from & and -# above. Then the functions {fig;} give rise to 
a rational map ~ of V into P[™*? "*-!_ If we denote by ¢+¢’ 
any one of the rational maps derived from &”+-/4 then obviously 


hy = hohe = and hy a ho+@ . 
Summarizing, we get: 


Property 1. Let V be acomplete, abstract, normal variety defined 
over K. Let &,-4 be two linear systems on V, also defined over 
K, and without fixed points. Let ¢,¢' be two rational maps of V 
into P™,P” respectively, derived from these systems over K and let 
y+’ be a rational map derived from & + & Then 


hore ~ hehe . 
The next property also follows immediately from the definitions. 


Property 2. Let U, V be two complete abstract varieties defined 
over K. Assume that U is normal, and V non-singular. Let ow. U->V 
be a morphism, surjective and defined over K. Let & be a linear 
system on V defined over K, and let wo (.&) be the linear system 
on U consisting of all divisors w (X)as X ranges over &. Let ¢ 
be a rational map of V into P” derived from & over K. Then gow 
is a rational map derived from the linear system w'(&). Assume 
in addition that & is without fixed points. Then so is w '(&) and 
we have 


Noow = hyow a 
Proof: We have assumed that V is non-singular in order to ensure 
that the inverse image of a divisor linearly equivalent to 0 is also 


linearly equivalent to 0. That wo (&) is a linear system is obvi- 
ous, and so is the rest of our assertion. 


§2. Criterion for equivalence 


The next property is the most crucial one. 


Property 3. Let V be acombplete, abstract, normal variety defined 
over K. Let &, # be two linear systems on V, also defined over 
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K, and without fixed points. Let ¢, be two rational maps of V 
into P”, P” respectively, derived from these systems. If the divisors 
of & and are linearly equivalent to each other, then hg and hy 
are equivalent. 

Proof: We shall first need alemma. In the course of its proof, 
we shall use the fact that if a function f¢ K(V) is not defined at 
a point @ of V, then it has a pole passing through Q (IAG, Propo- 
sition 3 of Chapter VI, § 1). 

As usual, we let o be greater than any real number. Also, the 
product formula is irrelevant for the lemma, so we restate the 
hypotheses completely. 


Lemma 1. Let K be a field with a proper set of absolute values 
Mr. Let V be acomplete normal variety defined over K. Let f,, ++, 
Sm be functions in K(V) whose divisors of zeros have no point in 
common. Then there exists an Mx-divisor c such that for all K- 
valued places x of K(V) over K and ail we Mz we have 


SUPe | 2 fa) lw 2 clw) . 


Proof: In the ring K[/i, ---,fm] consider the ideal (fi, ---, fm). 
It must be the unit ideal, for otherwise there would exist a homo- 
morphism of our ring over K mapping all the f, on 0, and this 
homomorphism can be extended to a place. Following it up by 
another place if necessary we may assume that this place is K- 
valued and that it has been extended to K(V). Some conjugate 
place will then be the identity on A, and this contradicts our as- 
sumption. Hence there exists an expression of the type 


LS Seen, Pn) 


where the z lie in K, and the M, are monomials with coefficients 
1 and of degree =1. Consider the finite set Sx consisting of all 
archimedean absolute values and all those v where some | 2a |, #1. 
We can take c(v) = 1 for v€Sx, and for v¢ Sz, whenever all the 
(fw) are finite, the relation 


1 = Samah), +++, (fm) 
shows that the | z(fa) |. cannot all lie below an obvious lower bound, 


determined by the absolute values of the z,. This proves our lem- 


ma. 
Let X, be a divisor of &, rational over K, so that the derived 


space of functions L, has a basis (fo, --:, fm) defined over AK. For 
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each f;, we can write 
(fi) = Xi — Xo KGeEF . 


We denote by ¢; the rational map of V into the affine K-open sub- 
set of P” determined by the functions (fo/fi, ---,fm/fi). We let Vi 
be the K-open subset V—Supp(X;) of V. Then the V; cover V, and 
vy; is defined at every point of Vj. 

We proceed similarly with 4 In view of our assumption, we 
may take the divisor Y, = X, to get the derived space of functions 
M, with a basis (g, ---,9n) defined over K. Then 


(93) = ¥o — Xe Y;e.4, 


and the (9, -:+,9n) determine our map ¢: V— P”. 
We apply our lemma to the functions fo/g, ---,fm/g where g is 
any one of the g;. Then 


(filg) = Xi — Y 
with Y > 0 and the hypothesis of the lemma is satisfied. For any 
point P € supp(X,), such that 9(P) +0, we get 
sup: | fiCP)/9(P) |v 2 cv) 
for any wé Mg extending ve Mx, and hence 


sup: | fi(P) lw 2 cv) | 9(P) |v. 


This last relation remains valid whether g(P) is 0 or not (always 
provided P ¢ supp(X,)). Since g was selected arbitrarily among the 
9;, we get for any P € supp(X,) 


sup; | fi(P) |w 2 c(v) sup; | 9i(P) lw . 
If P is rational over the finite extension E of K, then w may be 
viewed as being in Mr. We raise this inequality to the power 
Nw = [E,: F.], take the product over all w¢ Mz, and take the 
1/{E:F]-th root. In view of the facts that c(v) =1 for v not lying 
in some fixed finite subset Sz of Mg, and that our absolute values 
are proper, we get 
Cihy(P) S ho P) 


where c, is the fixed number [].es,c(v) which does not depend on 
the field E, and thus holds for all points P algebraic over K, 
P € supp(X). 


Our constant c, of course depends on V,, and we may write c,= 
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c:(V.). Since V is covered by Vo,+--, Vn, we can now take for 
c, the smallest of the c,(V;), so that our inequality above holds for 
all Pin V algebraic over K. 

The inequality on the right follows by symmetry. 


§3. Criterion for quasi-equivalence 


From Property 3 and lemmas of pure algebraic geometry, we 
shall prove that if the divisors of the linear system are algebrai- 
cally equivalent, then the associated height functions are quasi- 
equivalent. 


Lemma 2, Let V be a non-singular projective variety, and X a 
hyperplane section. Given an integer d, there exists a positive integer e 
depending only on V and d such that for any positive divisor Y on 
V of degree d, the divisors Y + eX and —Y + eX are ample. 

Proof: The assertion concerning —Y + eX is proved in F’-IX, 
Theorem 13. The other assertion is an immediate consequence of 
this one: namely, we first find e, > 0 such that —Y + e,X is ample. 
It is then linearly equivalent to a positive divisor 7, and by cutt- 
ing with generic linear varieties of suitable dimension, we see that 
the degree of J depends only on that of Y, and e,. Repeating our 
application of Theorem 13, we find an integer e, >0 such that 
—T+e,X is ample, and e, then depends only on d. This proves 
our lemma. 


Lemma 3. Let V be a complete non-singular variety. Let X be 
a divisor on V such that some positive multiple of X is ample. Then 
there exists an integer e >0 such that for any divisor Z on V algeb- 
raically equivalent to 0, the divisor Z + eX is ample. 

Proof: Let A be the Picard variety of V. We can always find 
a Poincaré divisor D on V x A which is positive. If V is an abe- 
lian variety, this is AV, Theorem 10 of Chapter IV, § 4, and 
otherwise, making a generic translation on D, the pull-back method 
gives a positive Poincaré divisor on V x A (AV, Theorem 1 of 
Chapter VI, §1). . 

By hypothesis, Z~ ‘D(a) —‘D(0) for some point a@¢ A. The in- 
tersections are defined after making a generic translation on D. By 
Lemma 2, there exists an integer e; > 0 such that —'*D(0) + aX is 
ample. Furthermore, the divisors ‘D(a) as a ranges over A are 
all algebraically equivalent to each other, are positive divisors, and 
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have the same projective degree. Hence there exists an integer e, >0 
such that ‘D(a) + e.X is ample, again by Lemma 2. From this our 
lemma is immediate. 

The following notation will be convenient. If Y is a divisor on 
V defined over K, such that its complete linear system is without 
fixed points, then we shall denote by hy the height function hy as- 
sociated with any one of the maps ¢ derived from this linear sys- 
tem. Thus hy is well defined, up to equivalence. (Similarly, for 
points in Vx, we shall write Hy for Ay.) 


Propverty 4. Let V be a complete non-singular variety defined over 
K. Let X,Y >0 be two positive divisors on V, rational over K. 
Assume that a positive multiple of each is ample, and that the linear 
systems &(X) and L(Y) are without fixed points. If X and Y 
are algebraically equivalent, then hx and hy are quasi-equivalent. 

Proof: Using Property 3, we shall reduce our assertion to a 
statement concerning linear equivalence of divisors on V. By 
Lemma 3, there exists an integer e > 0 such that for all 2 > 0 we 


have 
mX—-Y)+ex~ Z, 


where Z, is a positive divisor on V, and &(Z,) is without fixed 
points. Since m(X — Y) + eX is rational over K, one may take Z, 
rational over K. We get nX +eX~nY-+ Z, and taking heights, 
this yields in view of Property 1, 


n+ 
hx o~ hzhz, * 


Since the height is always = 1, we take an mth root of our relation 
for m large. We see that given e > 0 there exists a number c>0 
such that 


chx(P)'** = hy(P) 


for all points P on V, algebraic over K. The other inequality is 
obtained in a similar way, or by symmetry. 

An important case is that in which V is a curve. In that case, 
algebraic equivalence is determined by the degree, and hence we 
get: 


Corotuary. Let V be a curve, i.e. of dimension 1 in the preced- 
ing property. If deg(X)=d and deg(Y)=d', then hx is quasi- 
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One can give a direct proof using the Riemann-Roch theorem 
formally instead of the Picard variety, as in [37]. Thus when V 
is of dimension 1, very little machinery is needed to establish Prop- 
erties 3 and 4. 


§4. Projections on curves 


In the application of the theory of heights in Chapter VII we 
shall deal with curves, and it is therefore convenient to investigate 
more closely how the height behaves under projection. 

Let U be a projective non-singular curve defined over K. The 
height / is taken with respect to this embedding. 


Proposition 2. Let (yo, --+, yn) with yo =1, be functions in K(U) 
determining our embedding in P". Let © be a fintte set of points of 
U in. K. Then there is a polynomial equation such that if (do, -*-, 


Qn, bo, +++, bn) are elements of K not satisfying this equation, then the 
Function 
pee Giaeck rene + GnVn 


has the following properties: 

(i) The function y is not constant, and has no zero or pole 
among the points of ®. 

(ii) If h, is the height determined by the mapping of U into P* 
arising from the function y, then h,~h. a 

(iii) The mapping Q — y(Q) gives an injection of © into K. 

Proof; Let Q be a point of U in K. If t is a function of order 
1 at Q, then each y; has an expansion as a power series in ¢, say 


yi = Et +--- with an integer e;, which may be negative, and 
£;¢K. We see that there is a polynomial Ge (linear, in fact) such 
that for any set of elements (q,-+-+-,a@,) in K for which Ge(a) +0, 


then ayo + +++ + @nyn has order e at QM, where e = inf,e;. Taking 
Q from a finite set ®, we then take the product of the Gg for QVE® 
and achieve the same thing for all Qe9. 

If a denotes the sup of the polar divisors of our given y;, then 
we see that almost all linear combinations ayo + -+- + @nyn, have 
precisely a as polar divisor. Furthermore, applying the above re- 
marks to zeros instead of poles, and taking into account that the 
linear system determined by (1, 1, °++, Jn) is without a fixed point, 
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we see that we can make a sufficiently general choice of a; and b; 
such that the function y has no zero or pole in 9, and its divisor 


(¥) = (Vo — (9) a0 


is such that (y)» lies in the above linear system. According to 
Property 3 of heights, it follows that h, is equivalent to h. 

To insure that the map Q— y(Q) is injective on 9, we select 
among the y,; (for each Q) one of the functions having the highest 
order pole at Q, and denote it by yg. All quotients y./ye are de- 
fined at Q, and we have 


— aatoo(Q) + +--+ at0e(@) 
WQ) = bow(Q) +--+ + b,w.(Q) 


where wi = yi/yq. (Strictly speaking, each w; should carry Q also 
as an index.) We can choose the 0; so that the denominator does 
not vanish, and the condition that y(Q) + y(Q’) when Q + Q’ are 
two distinct points of @ is immediately seen to be implied by the 
nonvanishing of a polynomial in the @; and b;. This concludes 
the proof of our three statements. 


Historical Note: 


For some time, the properties of heights were mixed up with 
the properties centering around the decomposition theorem. These 
originated mainly with Weil, to whom Property 3 is due. Proper- 
ties 1 and 2 are of course trivial and are stated explicitly only to 
have a ready formalism available. We have taken our proof of 
Property 3 from [40]. 

Property 4 in the case of curves was essentially discovered by 
Siegel [37], as it was needed for the study of integral points. It 
was generalized to arbitrary dimensions by Lang [18] (for the same 
purpose). It can be used efficiently in the infinite descent. 

The property of projections has been abstracted from Siegel’s 
and Mahler’s papers, always motivated by its application to integ- 
ral points [18], [21]. It will not be used until Chapter VII. 


CHAPTER V 


The Mordell-Weil Theorem 


We consider abelian varieties, defined over essentially global fields, 
namely, those of Examples 1, 2, and 3 of Chapter II, §1. We shall 
prove an absolute result and a relative one concerning the group 
of rational points of an abelian variety over such fields, namely: 


TuHeorREM 1. Let K be a finitely generated field over the prime field. 
Let A be an abelian variety defined over K. Then Ax is finitely 
generated. 

To state the second theorem, we recall a definition. Let K be 
a function field over the constant field & Let A be an abelian 
variety defined over K. A K/k-trace of A is a pair (B,r) consisting 
of an abelian variety B defined over k and a homomorphism 


tc: BoA 


defined over K having the following universal mapping property: 
Given an abelian variety C defined over k and a homomorphism 
yg: C— A defined over K, then there exists a unique homomorphism 
¢~x: C— B defined over & such that the diagram 


, 
B 
is commutative. For the existence of the K/k-trace see AV [17], 
Theorem 8 of Chapter VIII, §3. We also recall that if E is an 
extension of & which is free from K over k, then (B,r) is also a 
KE/E-trace: It does not change under constant field extensions, 
because K is regular over k by assumption. Furthermore, the 
homomorphism rt is injective. 

We can now state our relative result. 

Tueorem 2. Let K be a function field over the constant field k, 
and let A be an abelian variety defined over K. Let (B,t) be a 
K/k-trace of A. Then Ax/cB, is finitely generated. 

For instance, if A is an elliptic curve with transcendental j-in- 
variant over k, and K = k(j), then B = Oand Ax is finitely generated. 


( 
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We prove the theorem in two steps. First the so-called weak 
Mordell-Weil theorem, where we prove that for an integer m2 1 
prime to the characteristic, Ax/mAx is finite under the following 
additional assumptions: 

The group of points A, of period m is contained in Ax. 

In the function field case, the constant field is algebraically closed, 
and the dimension of K over k is equal to 1. 

Indeed, we prove our result by relating our factor group Ax/mAx 
to the Galois group of certain unramified extensions of K. This 
is the Kummer theory, which we reproduce in §2. Using the fact 
that unit groups and ideal class groups are of finite type, we prove 
that certain unramified extensions are finite, and then that A«c/mAx 
is finite. In the function field case, these arguments certainly 
apply in dimension 1, and reduction steps proved in §1 show that 
we can limit ourselves to this case in order to prove Theorem 2. 
We shall also prove in §1 that we may assume that A,» Cc Ax, and 
that k is algebraically closed. 

Once this is carried out, we apply Property 4 of heights to a- 
belian varieties and see that the logarithm of the height function 
is of quasi-degree 2 so that we can apply Proposition 7 of Chapter 
IV, §4 and its corollary. In number fields, we are then finished. 
In function fields, we must make an analysis of sets of points of 
bounded height. We shall find out that they lie in a finite number 
of cosets of tB,. This will thus prove Theorem 2. 

We then return to Theorem 1 for finitely generated fields over 
the prime field, viewing the algebraic closure & of the prime field 
in K as a constant field. Applying Theorem 2, the known result 
in number fields in characteristic 0, and the trivial fact that B, 
is finite if & is finite, we conclude the proof of Theorem 1. 

We can then apply Theorems 1 and 2 to get results concerning 
divisor classes. 


§1. Reduction steps 


The propositions which we prove in this setion will be used to 
reduce our main theorem in function fields to special cases. Through- 
out this section, K will denote a finitely generated regular extension 
of a field &. 

We first show that to prove our main theorem we may extend 
our function field by a finite separable extension. 
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Proposition 1. Let L> K be a finite separable extension of K, 
also regular over k. Let A be an abelian variety defined over K. 
Let (A*", cx) be its K/k-trace, and (A™",r,) its L/k-trace. Then the 
factor group 


[cA N Aglitx(A*"), 
is finite. 
Proof: Let T; be the graph of tz. It is defined over L. Let 
us take the intersection 


H= nr 


of I’; and its conjugates over K, where o ranges over the distinct 
isomorphisms of L over k. Then H is K-closed, and is an algebraic 
subgroup of A”/* x A. Its connected component H, it therefore 
defined over K by Chow’s theorem (AV Theorem 5, Chapter 
eS 2): 

We have 


Pr, 9 (A) X Ag] = Hn [(A™); x Ag] 


and this group contains Hy 9 {(A’/")* x Ag] as a subgroup of finite 
index since Hy is of finite index in H. 
We have a surjective homomorphism 


Tr, 9 [((A™), x Ag] tA"), 9 Ax 


by projection on the second factor. We contend that the inverse 
image of tx(A*/"), contains Hy n [(A”* x Ax]. From this it will 
be clear that the desired factor group is finite. 

Since H is contained in the graph of a homomorphism of A’”/* 
into A, it is itself the graph of a homomorphism § of its projec- 
tion B on the first factor into A. Furthermore, B is contained in 
A’/* and is defined over K, and hence over k by Chow’s theorem. 
By the universal mapping property of the K/k-trace there exists a 
commutative diagram 


B 
18) Se re el 
Ae 


with $, defined over k, hence §(B;) is contained in tr(A*/),. This 
proves our contention and concludes the proof of our proposition. 
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Corotuary. Jf Theorem 2 is true for Lik, then it is true for K/k, 
ie. if Azlt,(A™, is finitely generated, so is Axgitx(A*/"). 

Proof: This follows immediately from the proposition and the 
fact that we have an injection 


0 Axi[t(A””) 9 Ax] > Arlt A” . 
Next we show that we may extend the constant field. 


Proposition 2. Let k’ be any extension of k which is independent 
of K over k, and let K'= Kk'. Let A be an abelian variety defined 
over K and let (B,r) be its K/k-trace. Then 


Abe i) clipe = Ge - 


Proof: We know that (B,r) is also a K'/k’-trace of A. The 
inclusion tB, C Ag N ty is obvious. Conversely, let 0 be a point 
of B, such that rb is rational over K. If we knew that rt is regu- 
lar, i.e. that it is birational between B and its image in A, then 
in view of the fact that ct is defined over K, we would see im- 
mediately that r ‘rb is rational over k. As we do not know that 
t is regular, we use Chow’s regularity theorem of AV Chapter 
9. Wecan write K =k(u) for suitable parameters u, and A= Anz, 
t=t,. Let m,-+++,u%m be independent generic specializations of u 
over k’. For m large, the map 


> (Tagen > » 5 agian) 
of B into A,, x Au, X ++: X Au, is regular, and thus birational 
between B and its image. If rd is rational over K, then (ceo, 9 
Tu,D) is rational over k(u,,°++-,%,,); hence, b is rational over 
R(t, +++, Um). 
Since we took k(u,, +--+, Um) free from k’ over k, and since 0 is rational 
over k’, it follows that 0 is rational over k. This concludes the 


proof. 
Finally, we give the reduction to the case where K is of dimen- 
sion 1 over k. 


Proposition 3. Let K> E>k be a tower of fields such that K is 
regular over E and E regular over k. If Theorem 2 is true for 
K/E and E/k, then it is true for K/k. 

Proof: Let A be an abelian variety defined over K, and let 
(A*/* cen) be a K/E-trace of A. Let (A?/",r9,) be an E/k-trace 
of A*/”. By the universal property, there is an injective homo- 
morphism §: A*/"-—> A*/* defined over k such that the following 
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diagram is commutative: 


TBI/k | B 


Arlt 
All the homomorphisms are injective. We have 
Ag Dtgj(A*)g > tKxsatanlAn), = te nB(An . 


If we assume that (A*/”), is of finite type modulo ry/,(A*”),, it 
follows that Ag modulo rr/,8(A®”™), is also of finite type. But this 
last group is contained in tx;,(A*/"),. We see, therefore, that it 
suffices to prove Theorem 2 for each step K/E and E/k of the 
tower. 

It is well known and easy to show that one can always construct 
a tower Ey =k CE, c--- C E,=K such that each step E;/E;-; is 
regular and of transcendence degree 1, Such a construction comes 
from the lemma of Bertini’s theorem, which states that if x,y are 
two elements of K, algebraically independent over k, and such that 
y € K*k, then for all but a finite number of contants c € k, K is 
regular over k(x + cy). (Cf. IAG, Chapter VI.) If & is finite, one 
needs a small additional argument. However, in view of Proposi- 
tion 2, for our purposes, we may assume that & is infinite. 

The above reduction steps have thus reduced the proof of Theorem 
2 to the case where Ax contains Am, k is algebraically closed, and K 
is of transcendence degree 1 over k. 


§2. Kummer theory 


Throughout this section, we let K be a field and m an integer 
> 1, prime to the characteristic of K. 

An abelian extension E of K will be said to be of exponent m 
| if o” =1 for every automorphism o of E over K. 

Let A be a commutative group variety defined over K. As usual, 
| we denote its group of rational points in K by Ag. We let An 
| be the subgroup of those points a@¢ A such that ma = 0 (writing 

A additively). 
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We recall that the map x— mx is surjective and separable (so 
that in particular, A» is finite, of order equal to v(md)). Quick proof: 
If f,, ---,f, are local paramenters at the origin, then 


fi(mP) = m- f{P) mod mW’ 


where m is the maximal ideal of the local ring at the origin 
(IAG, Proposition 1 of Chapter IX, §1). Thus md is generically 
surjective and separable. If a is a given point, and x a generic 
point over K(a), then a — x is generic over K. By an m-th root of 
a we shall mean a point 6 such that mb=a. Any two such 
m-th roots differ by a point of An. If my =x and mz=a— x then 
my +z)=a. Since K(x, y) and K(x, z) are separable over K(x) 
and over K, it follows that K(y + z) is also separable over K. 

If a€¢ Ag, we denote by K(1/m-a) the field obtained by taking the 
composite of all fields K(b) where 6 ranges over all m-th roots of 
a. Tben all points of A,» are rational over that field, which is 
separable over K by what we have just proved. If A, Cc Ax, then 
this field is equal to K(b) where 6 is any one of the m-th roots of a. 

From now on, we assume that A, is contained in Ag. Let B 
be a subgroup of Ax containing mAxg (i.e. the set of all points 
ma with a€ Ax). We denote by 


Ey = Kame) 
the field obtained by taking the composite of all fields K(1/m-a) 
where a ranges over B. It is determined by B, in a given alge- 
braic closure of K, as usual. 

If o is any isomorphism of E, over K, then with the above nota- 
tion and with ma =a, we have 
moa) = o(ma)= ma. 

Consequently sa —a is an element of An, say te. Thus 

oa=art+te. 


Hence o is in fact an automorphism of E, over K, and Ey is 
therefore Galois. If tr is another automorphism of E, over K, and 
ta =a+t, then clearly cra =a+t,+t, and hence E, is abelian. 

Let G be its Galois group. We define a bilinear map 


Gx B-A, 


as follows: Let o€G and be B. Take any BEA such that mB = 
b. Then o8 — 8 is clearly independent of our choice of 8, and is 
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an element of An, which we denote by (0,5). The reader will 
immediately verify that this is bilinear in o and 8. 

By definition, the kernel on the left consists of those o such that 
(o,b)=1 for all b€ B. As Ex is generated by elements of type 
1/m-b any such o must be the identity. 

The kernel on the right consists of those 6 such that (¢,b)=1 
for all o€G. For any such 3, the extension K(1/m-b) is fixed under 
G and hence must be K itself. This means that 1/m-b is in fact 
rational over K, i.e. that b lies in mAg. We thus get a bilinear 
map 

G xX BlmAxg— An 


whose kernel on both sides is equal to the identity. Since A,» is 
finite, we obtain 


Proposition 4. Let A be a commutative group variety defined over 
the field K. Let m be an integer = 1, prime to the characteristic. 
Let An < Ax, and let B be a subgroup of Ag containing mAx. Then 
K(1/m-B) is an abelian extension of exponent m and is a finite ex- 
tension if and only if BlmAx is a finite group. 

Let us consider in greater detail the special case where A is the 
multiplicative group. We then write the group law multiplicatively. 
Our group A, is K%, the group of m-th roots of unity, and is cyclic 
of order m. Thus the two groups G and B/K*™ are dual to each 
other under our pairing above, and of the same order if they are 
finite. (We shall not need the infinite case in the sequel.) 

If B,, B, are two subgroups of K* containing K*” and if B, ¢ 
B, then K(Bi/") c K(Bi/”). Conversely, assume this is the case. 
We wish to prove B,c B,. Let b€B,. Then K(b/") c K(B”) 
and K(b'’”) is contained in a finitely generated subextension of 
K(By"). Thus we may assume without loss of generality that 
B/K*” is finitely generated. Let B; be the subgroup of K* gen- 
erated by B, and 6. Then A(B}/") = K(B;”) and from what we 
have see above, the degree of this field over K is precisely 

(oe: A) Onn __), 
Thus these two indices are equal, and B,= 83. Summarizing: 

Tueorem 3. Let K be a field, m an integer 21 prime to its 
characteristic. Assume that the m-th roots of unity lie in K. Then 
there is a bijective correspondence between abelian extensions E of 
K of exponent m and subgroups B of K* containing K*", given by 
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E= K(B‘"), and 
[E27 Kk) =(8 ea 


in the sense that if one index is finite, so is the other, and they are 
equal. 

Proof: We have proved everything except that every abelian 
extension of K of exponent m is of type A(B’”) for some B. But 
such an extension is a composite of finite cyclic extensions of ex- 
ponents m. Our assertion follows then from Galois theory (Hilbert’s 
theorem 90). 

We shall say that the abelian extension AK(B’’”) and B belong to 
each other. Theorem 3 is said to express the Kummer theory, 
and extensions of this type are called Kummer extensions. 

We wish now to determine when a Kummer extension is un- 
ramified for a discrete valuation. 


Proposition 5. Let v be a discrete absolute value on K, and aeé 
K*. Assume that m is prime to the characteristic of the residue 
class fiield of v. Then v is unramified in K(a’’”) if and only if 
ord,a is divisible by m. 

Proof: Assume first that the order of a is divisible by m. Then 
we can write a@=xzx""u where z is a prime element, 7 an integer, 
and «a unit. We see that K(a'/") = K(u’’") and that u'/” is a 
unit, satisfying X”—u=0. Since m is assumed to be prime to 
the characteristic, we can apply the criterion of Proposition 10 of 
Chapter I, §3 to conclude that K(a'/") is unramified at any ex- 
tension of v. Conversely, assume this the case. Then if a@=7*u 
with some integer s, it must be so that m divides s, otherwise, 
the value group of our extension K(a’/”) would obviously be unequal 
to the value group v(K*), and so the extension could not be un- 
ramified. 

Let K have a proper set of absolute values Mg and assume that 
all the non-archimedean absolute values of Mx are discrete. We 
shall say that an algebraic extension E of K is unramified over 
Mg, or that Mx is unramified in E, if every non-archimedean v € Mz 
is unramified in E. 

We continue to assume that the m-th roots of unity lie in K, 
and we let E bea Kummer extension of K, of exponent m, belong- 
ing to the subgroup B of K*. Using Proposition 2, we see that 
Mg is unramified in E if and only if, for each b€ B, there exists 
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an M,-ideal 6 such that 
(b) = mb. 
Hence: 


TuEorEM 4. Let K be a field with a proper set of absolute values 
Mx. Assume all the non-archimedean ones are discrete. Let m be 
an integer prime to the characteristics of the residue class fields of 
the discrete v€ Mx, and assume all m-th roots of unity le in K. 
Let B, be the subgroup of K* belonging to the maximal abelian 
extension of K, of exponent m, and unramified over Mg. Then B, 
consists of those elements b¢ K* for which there exists an Mx-ideal 
6b such that (b)=mb. If C is the group of Mx-ideal classes, Cn 
those of period m, and U the group of Mr-units, then we have an 
exact sequence 


0- U/U" > B,/K*" > C,- 0. 


Proof: Vf b€ Bu, and we have (6) = mb as above, then we map 
b on the ideal class of 6, which is obviously in C,. The map is 
surjective in view of Proposition 5, and K*” is obviously contained 
in the kernel. Thus we get the map of B,/K*”" onto Cy. The 
reader will immediately verify that its kernel consists of the units 
modulo. U%= Un Kk". 


Corouuary. If in the above theorem, both Cy and U |U™ are finite, 
then so is the maximal abelian extension of K, of exponent m, and 
unramified over Mr. 

One can often reduce the study of ramified extensions to that of 
unramified extensions, over a larger ground field, as follows: 


Lemma 1. Let K be a field, m an integer =1 prime to its charac- 
teristic, and E and abelian extension of exponent m. Let v be a 
discrete valuation on K, and x a prime element. Assume that the 
m-th roots of unity liein K and that m is prime to the characteristic 
of the residue class field of v. Let L = K(n'/"). Then every absolute 
value w of L extending v is unramified in EL. 

Proof: This is immediate from Proposition 5. 

We recall that an unramified extension stays unramified under 
translation (Proposition 11 of Chapter I, §3). Thus repeated appli- 
cation of Lemma 1 kills the ramification at any given finite number 
of absolute values of K, by means of a finite extension. We apply 
this to get the fundamental result: 
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Tueorem 5. Let K be a number field, or a function field in one 
variable over the algebraically closed constant field k. Let m be an 
integer =1, and prime to the characteristic. Let Mr be the canoni- 
cal set of absolute values of K, as in Examples 1 and 2 of Chapter 
II, §1, and let S be a finite subset of Mx. Then there is only a 
finite number of abelian extension of K, of exponent m, which are 
unramified over Mg — S. 


Proof: We may assume S contains the divisors of m. Let E 
be the composite of all abelian extension of K, of exponent m, 
unramified over Me—S. Let L be obtained by adjoining the 
m-th roots of unity to K, and also 7;/”,---,;/" where m, °°, 7s 
are prime elements at the discrete valuations of S. Then EL, 
viewed as an extension of ZL, is unramified over M;, by our 
lemma. We now apply the corollary of Theorem 4, the finiteness 
of the class number of Theorem 1, Chapter II, §2, and the unit 
theorem in number fields. In the function field case, the units are 
merely the non-zero constants, and the existence of the Jacobian 
guarantees that the divisor classes of exponent m form a finite 
group. Thus our corollary also works in this case. 


§3. The weak Mordell-Weil theorem 


If A is an abelian variety defined over a field K, and v is a 
discrete valuation on K, then we can define the reduction of A 
with respect to this valution [36]. We denote it by A’, and use 
the prime to denote reduction of objects like cycles, points, etc. 
In general, A} is merely a cycle. 

If A} has one component, with multiplicity 1, which is an abelian 
variety whose law of composition is obtained by reducing that of 
A, then we shall say that the reduction is non-degenerate. The 
following lemma can be proved by a systematic procedure which 
follows the one at the end of [36]. 


Lemma 2. Let K bea field with a proper set Mx of discrete valua- 
tions. Let A be an abelian variety defined over K. Then for all 
but a finite number of ve Mx, the reduction A’, is non-degenerate. 

Proof: We leave it to the reader. It consists in looking sepa- 
rately at each geometric property entering into the definition of 
an abelian variety and its reduction: The fact that the variety remains 
non-singular, that certain maps remain morphisms, that the associ- 
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ative law holds, etc. In each case, one proves that there exists 
an element a€ K* such that if |a@|.=1, then the reduction A‘, has 
the corresponding geometric property. 


Lemma 3. Let K be a field with a discrete valuation v. Let A be 
an abelian variety defined over K, and having a non-degenerate re- 
duction A,. Let m be an integer =1, prime to the characteristic of 
K and to the characteristic of the residue class field of v. Assume 
Am < Ax. Then v is unramified in the extension K(1/m- Ax). 

Proof: Since our statement is local in v, we may assume that K 
is complete under our valuation. Furthermore, it suffices to prove 
that for each a€ Ax, the extension K(1/m-a) is unramified over K. 
Let P be a point on A-such that mP=a. The automorphisms of 
K(P) over K are given by translations as in §1, and if 


fo — RP): el 
then there are m elements a, ---,d@, of Am such that the auto- 
morphisms oa; (i = 1,---, ) satisfying 

oP=P + a; 


give all automorphisms of A(P) over K. Let aj be the reduced 
points. Since m is prime to the characteristic of the residue class 
field, it follows that aj,---,a, are distinct. If g is any place of 
K(P) extending the canonical place of v, then go; are also places 
of A(P) and 
Gs(2) = (P+ ay =P! + ai. 

Hence the go; are distinct, and by Proposition 8 of Chapter I, §3 
we conclude that A(P) is unramified over K, as desired. 

From the two lemmas, Proposition 4 of §2, and Theorem 4 of 
§2, we get: 

Proposition 5. Let K be a number field, or a function field in one 
variable over an algebraically closed constant field k. Let m be an 
integer = 1, prime to the characteristic of K, and A an abelian 
variety defined over K such that An © Ax. Then Ax/mAx ts finite. 

Of course, in the function field case, we shall later eliminate the 
restriction that the function field is of dimension 1. For the mo- 
ment, we turn to the full Mordell-Weil theorem. 


§4, The infinite descent 


Let F be a field with a proper set of absolute values Mr 
satisfying the product formula, and let K be a finite extension of 
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F, with its set of absolute values Mx. Let A be an abelian variety 
defined over K, which we assume to be embedded in projective 
space by means of the complete linear system &. Let X€ SS, 
let m be an integer >1, and ma¢ Ax. Let 


o:A-~A 

be the isogeny wu = mu-+a. Then 
w(X) = (mo)'X_. = mM? X-4 = mM’ X 

where = is algebraic equivalence (cf. AV, Proposition 2 of Chapter 
IV, §1 together with the known fact that the torsion group 

D,(A)/ D(A) 
is trivial). By properties 1, 2, and 4 of heights, we see that 
Hew is quasi-equivalent to H”’, if we denote by H= Hy, the 
height function determined by our given projective embedding 
and our set Mz. In particular, if 7 is the logarithm of the height, 
then it defines a function of quasi-degree 2 on Ax in the sense of 
Chapter Ill, §4 and we can apply Proposition 7 of that chapter. 
If K is a number field, then 7 is non-degenerate by Theorem 1 of 
Chapter III, §2 and consequently we have proved Theorem 1 in 
this case. 

Furthermore, Theorem 3 of Chapter III, §4 becomes applicable, 

and thus we have proved: 


THEOREM 5. Let A be an abelian variety defined over a number 
field K, and in projective space. Then Ax is finitely generated. 
Given « >0, there exist two numbers c,(e), ce) > 0 and a number 
6,0 <6<e, such that for all B> Bye) the number of points Pe Ax 
with H(P) <= B satisfies the inequalities 

cx(e)(log B)'?~* = NB) S ,(¢)(log BY?** 
where r is the rank of Ax. 

To conclude our proof of Theorem 2, we must investigate the 
nature of sets of points of bounded height. 


§5. Points of bounded height 


Let Wc P’ be a projective variety, non-singular in codimension 
1, defined over a field k. Let w be a generic point of W over &, 
and K=&k(w) a function field for W over k. Let A =A, be an 
abelian variety defined over K, and embedded in projective space 
P” over K. Let w be a generic point of A over K. We shall 
denote by WoA the locus of (w,u) over k. Then WoA is the 
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graph of the algebraic family parametrized by W, of which A is 
a generic member. If we put Z= WoA, then 


A = Zw) = prs |Z: (ww x P*)). 


Given a rational point P of A over K, we shall denote by Wp 
the locus of (w, P) over k. Then 


lie ce \ioal Un xe 1h we le Be jee 


Note that Wp and W are birationally equivalent under the pro- 
jection pr. 
There is a projective embedding 


¢:P" x P* > Perth m@ty-1 


which to each product of points with homogeneous coordinates (x, 
+, Xr) X (Yo, +++, Nn) assigns the point with homogeneous coordinates 
(xi ¥;). One verifies immediately that for any P€ Ax we have 


Hw, P) = Hw)+ H(P)=deg W+ HP). 


Here, of course, H = Hy, is the height, taken relative to our pro- 
jective embeddings and the canonical set of absolute values on K 
obtained from the prime rational divisors of W over k. 

Let c; be a number > 0 and S a subset of Ag such that H(P)s 
c; for all PeS. Then by Theorem 2 of Chapter III, §3 there 
exists a number c, such that deg g(Wp)<c, and hence, by the 
theory of Chow coordinates, Wp can belong only to a finite number 
of algebraic families on WoA for Pe S. We shall be through with 
the proof of Theorem 2 once we have proved: 


Tueorem 6. Let W be a projective variety, non-singular in co- 
dimension 1, defined over a field k. Let w be a generic point of W 


over k, and K=k(w) a function field of W over k. Let A be an 


| 


abelian variety defined over K, and (B,t) be its K/k-trace. For each 
point Pe Ax, let Wp be the locus of (w,P) over k. If P,P'€ Ax 
are such that Wp and Wp: are in the same algebraic family on 
WA, then P and P' are congruent modulo cB, (i.e. P— P'€tB,). 

Proof: In view of Proposition 2 of §1, we may assume that k 


_ is algebraically closed. We shall prove our theorem by going from 


W, to We: passing through a generic member of the family. Our 


| first step is to show that a generic member is of the same type 
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Lemma. 3. Let U be a subvariety of WoA, defined over an ex- 
tension ki of k independent of K over k, and such that Wp is a 
specialization of U over k. Put Ki=Kk;. Then there exists a 
rational point P, of A in K, such that U= Wpe,. 

Proof: We have priWe = W. Hence by the compatibility of 
projections with specializations, we must have pr,U = W. But dim 
U=dim W, and hence a generic point of U over k, is of type 
(w, Q) for some Q€P”, algebraic over k,(w). We must show that 
Q is rational over &,(w), and that Q is in A. 

On W x P” the intersection U-(w x P”) is obviously defined, and 
in view of the above remarks, we have 


U-(w x P*) = S\(w, Q,) 


for suitable points Q;. Taking the projection on W, we see that 
d=1. Since U-(w x P”") is rational over &,(w), so is Qi = Pi. 
Finally, we have P,¢ A because 


wx P= UG Xx Po ew cA): week — wea 


Note that in the lemma, we have made no use of the fact that 
A is an abelian variety: The same statement holds for any alge- 
braic system of varieties. 

It will suffice to prove Theorem 6 for Wp and Wp,. Indeed, 
given W, and Wy, as in the theorem, there exists a subvariety 
Wr, of WoA, defined over some &;,.such that both Wp and Wp, 
are specializations of Wp,. If P— P,¢rB,, and P'—P,eé 7B,,, then 
P— P’érB,,, and by Proposition 2 of §1 we get P— P’tB,. 

The field &, may be assumed to be of transcendence degree 1 
over k. Indeed, there exists a variety T, a generic point ¢,, and 
a point ¢ of T such that W» is the unique specialization of We, 
over the specialization ¢,->¢ (over k). The specialization t,t | 
over k can be extended to a place of k(t,) corresponding to a dis- 
crete valuation ring by blowing up our point ¢. Thus there exists 
a field k, such that kc k, c R(t), R(t,) has dimension 1 over hk, 
and the specialization #;->¢ is in fact over k,. Using Proposition 
2 of §1 we may replace k by k&,, and in fact may go over to the 
algebraic closure of our constant field, without loss of generality 
for the proof of Theorem 6, and we may assume that T is non- 
singular. 

We have the following diagram of fields: 
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Let Alb(T) be the Albanese variety of 7, defined over k, and 
let f: T— Alb(T) be a canonical map, defined over &. Let 
g:ToA 
be the rational map defined over K by the expression g(t,) = Py. 
Then there is a homomorphism a: Alb(T)— A and a point ae A 
such that the following diagram is commutative: 


is 
i—— Alb) 


ae 


A 
By the definition of the A/k-trace, there exists a homomorphism 
8: Alb(T)— 8 defined over k such that the following diagram is 
commutative: 
Alb(T) 
Mae 
A\ eo JB; 


Tt 
Since ¢ is simple on 7,/ is defined at ¢. Furthermore, since Wp 
is the unique specialization of Wp, over t,—># (relative to k or 
K), it follows that P is the unique specialization of P,; over t,t 
(relative to K). Hence g(t) = P, and 
P, — P= gtt:) — gf) = aft.) + a@— aff) —a 
= tBf(t:) — tB(2) 
== Gls) 
with 6 = B[ f(t) —f(@] ¢ B.,. This proves our theorem. 
Corouttary. Let A be projectively embedded in the above theorem, 
and let the height be taken relative to this embedding and the ca- 
nonical set of absolute values Muw). Let S be a subset of Ax of 
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bounded height. Then the points of S lie in a finite number of cosets 
of TB. 

Proof: As we saw in our discussion preceding the theorem, 
points P of bounded height are such that the varieties Wp have 
bounded degree, and thus we can apply the theorem. 


§6. Theorem of the base 


Let V be a projective variety, non-singular in codimension 1, 
and defined over the algebraically closed field Rk which we may 
take to be a universal domain. The group of divisors D(V), on 
V, rational over & contains the usual subgroups of divisors which 
are algebraically equivalent to 0 and linearly equivalent to 0, re- 
spectively. 


DV )k = DAV )e = DV )y a 


The Picard group D.(V):/Di(V), has the structure of an abelian 
variety, the Picard variety. The theorem of the base states: 


TueoremM 7. Let V, k be as above. Then the group D(V), modulo 
DAV), ts of finite type. 

We shall show below how this theorem can be reduced to Theo- 
rem 2. First, let us prove a corollary. 


Coro.tuary. Let K be a field of finite type over the prime field, 
and let V be a projective variety, non-singular in codimension 1, 
and defined over K. Let D(V)x be the group of divisors on V ra- 
tional over K, and Di V)x those which are linearly equivalent to 
zero and rational over K. Then the factor group 


D(V )z/DiiV )z 
is of finite type. 

Proof: To begin with, we may extend K by a finite extension 
so that V has a rational point. Then D.(V)g/Di(V)x is represented 
by the group Ax of the rational points on the Picard variety, over 
K. If k is the algebraic closure of the prime field in K, then we 
just apply Theorem 2 and the known theorem over number fields 
to conclude that Ax is finitely generated. We can now use the 
theorem of the base to finish the proof. 

Let us now prove Theorem 7. Let L, by a generic linear variety 
over k, such that V-L, = C, is a generic curve (cf. IAG, Chapter 
7). Let J be the Jacobian of C., defined over the field A(u). Let 
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DV), be the subgroup of D(V), consisting of those divisors X¢ 
D(V), such that X-C, is of degree 0. Then D(V),/D)(V), is in- 
finite cyclic, and it will suffice to prove that Di(V)/Da(V)x is of 
finite type. 

Let ¢: Z.(C.)-—> Z.(J) be the canonical map of the 0-cycles of 
degree 0 on C, into the 0-cycles on the Jacobian. Then 


a — S(g(a)) 
maps Z,(C,) into J, and preserves rationality (i.e. if a is rational 
over k(u), so is S(g(a))). We get a homomorphism 
A: DX Vk > Jew 
given by the formula 
MX) = S(o(X-C,)) . 
According to an elementary equivalence criterion [43], one knows 
that the kernel EF of 4, which contains D,(V);, is of finite type 
modulo DV). One can in fact prove that this kernel is precisely 
DV), if we have chosen our projective embedding suitably. This 
is irrelevant here, and we see in any case that [EF + Di. V);]/DiV). 
is of finite type. To prove the theorem of the base, it will there- 
fore suffice to prove that DV; )/[E + Da(V);] is of finite type. 
The inverse image 24*(A(D.(V)x)) is precisely equal to 
E SF DAV )e . 
We have therefore an injection 
0— D(V ME + Da V ae] > SJecu/ADCV )e « 

Now let ¢: V-A be a canonical map of V into its Albanese 
variety, defined over k. For a suitable constant b€ A, we have a 
commutative diagram 

4 


Cy, — V 


v | | ¢ 
i 
4x + 6 
where i, is induced by the inclusion. 

Let us look at the inverse images of divisors in D.(V), under 
the composite maps (7, + b)og and #oi. The formalism (fog) = 
g-'of~' can be applied to the first according to Appendix 1 of AV 
since J and A are non-singular. A direct verification using in- 
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tersection theory (associativity and the definition of C, = V- Lu) 
shows that it can also be applied to the second. According to the 
definition of the Picard variety, which we know is obtained by 
pull back from that of the Albanese variety, we see that the map 


4: DLV ji > Jkt) 


induces the rational homomorphism ‘7, on Ar = Di V)el/DAV), if 
we denote as usual by the upper index ¢ the transpose of 7, on 
the Picard varieties. Consequently we have an injection 


0 DV ME + Dal V il > Jecun/ie Ar - 


By Chow’s theory of the k(u)/k-trace, one knows that (A, 4 isua 
k(u)/k-trace of J= J (AV, Chapter 8, Theorem 12). Consequently 
we can now apply Theorem 2, and the theorem is proved. 


Historical Note: 


It was Mordell [23] who discovered that the group of rational 
points (over Q) of an elliptic curve is finitely generated. (It is 
also intersting to recall that in this paper, he conjectures that the 
set of integral points is finite, and that the set of rational points 
on a curve of genus 2 2 is finite. This second conjecture is still 
unproved.) 

The extension to number fields and abelian varieties offered seri- 
ous problems, because at that time, among other things, it was 
not clear how to estimate the size of a point. In Mordell’s case, 
over the rationals, there was no problem. For this purpose, Weil 
invented his decomposition theorem and the theory of distributions. 
Actually, as was shown by Lang-Néron [19], one does not need the 
full theory to carry out the infinite descent: The definition of 
heights and the standard properties suffice. However, the principle 
of the proof has remained the same since the early days of the 
theory, first the finiteness of Ax/mAx, and second the infinite 
descent. As was said above, this was done by Weil in his thesis 
[41]. 

I have given the proof of Lang-Tate for the first. Another method 
will be found in Roquette [29], but one should note that the finite 
generation of the unit group and divisor class group and the non- 
degeneracy of a reduction for almost all p are the essential in- 
gredients of these proofs. 

As the techniques of algebraic geometry became better, and prop- 
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erties or heights more organized, the infinite descent could be 
somewhat slicked up (cf. the progression [41], [24], [19]). 

Some time ago, Severi noticed the analogy between the theorem 
of the base on surfaces and the Mordell-Weil theorem. Making 
the analogy precise, Néron in his thesis [24] proved Theorem 1 
(i.e. the Mordell-Weil theorem for fields of finite type) and also the 
theorem of the base, in any characteristic. He had to overcome 
technical difficulties in algebraic geometry, because the Picard va- 
riety was not yet available. The fact that one can extract a 
relative Mordell-Weil theorem concerning only abelian varieties was 
given by Lang-Néron [19], whose exposition for this and the theorem 
of the base we have followed here. The theorem concerning sets 
of bounded height is also taken from [19]. 

As usual, our results are purely qualitative. It is of considerable 
interest to try to push the theory in the quantitative direction, 
and to give an explicit dermination for the set of generators of 
the Mordell-Weil group of rational points. In the geometric case, 
there is a result of Igusa [14] which ties it up with the second 
Betti number (he deals with a pencil on a surface), and with the 
group of principal homogeneous spaces of the abelian variety under 
consideration. This seem to be one of the main directions in which 
the theory is developing, and shows that one can expect some hard 
questions to arise. We may recall the following exact sequence, 
as in Lang-Tate [20]. 

Let A be an abelian variety (or even a commutative group va- 
riety) defined over the field AK, and denote by A also its group of 
points rational over the separable closure of K. Let G be the 
Galois group of K over K. Then G operates on A, and if we let 
the cohomology be the limit cohomology from finite separable ex- 
tensions, we have the exact sequence 


(fey ee ee 


for each integer m> 0 prime to the characteristic of K, giving 
rise to the exact cohomology sequence 


0—> An N Ag —> Ax —> Ag —> H"G, An) —> H"G, A) 


meee A). 
the beginning which we may rewrite in the following form: 


0 —> Ag/mAx ——> H'(G, An) —> H'(G, A)nm ——> 0 
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the group on the right being the kernel in H'‘(G, A) of the multi- 
plication by m. Thus H'(G, A)m is simply the group of principal 
homogeneous spaces of A over K, of period m. If Am C Ax, then 
we get back the Kummer theory. 

Special cases when A is an elliptic curve with complex mulipli- 
cation have been treated by Cassels [4]. Local results for elliptic 
curves have been obtained by Shafarevitch [35] and by Tate [38], 
who also gives a complete duality theory for abelian varieties, in 
preparation for global dualities. One of the main problems here 
is the conjecture of Shafarevitch-Tate, to the effect that those 
elements of H’(G, A) which split at all primes form a finite group, 
(conceivably modulo those which are infinitely divisible). 

A propos of counting points on an abelian variety of height 
< B, Néron has a very interesting conjecture (dating back to the 
International Congress at Edinburgh, 1958) that for a suitable choice 
of the projective model of an abelian variety over a number field 
K, having selected a basis a, ---,a, for Ax, the logarithm of the 
number of points of type 


Aya, ate BOR Se nN,Ga, 


of height < B is described by a quadratic form. This would of 
course give much more precise information than Néron’s estimate 
given in Theorem 5. 

I can write down formally a quadratic form, of which it would 
be interesting to know whether it coincides with Néron’s. Indeed, 
consider the maps 


Jl 3é gn0 3 Al os Al SK ooo & AL ss A 


the first one being (7,0, ---,”,0) and the second the sum. Using 
AV, Proposition 2 of Chapter IV, §1 let X be a hyperplane on A, 
and take its inverse image by the two maps. This leads one to 
consider the form 


D [nx(ai + a3) — x(a) — gx(a;)\nin; 


where yx is the logarithm of the height. 

It would also be interesting to investigate the mapping mé in 
an algebraic system of abelian varieties over, say, a number field, 
and to see whether the constants involved in the equivalence H(mP) 
~ H(P)"’ can be chosen uniformly for the system. 


CHAPTER VI 
The Thue-Siegel-Roth Theorem 


We shall give an exposition of this important theorem under 
axioms which are valid in number fields and function fields, namely, 
the product formula and a weak Riemann-Roch condition. We also 
need characteristic 0. In §1 we state the theorem. In §2 we re- 
formulate it by a formal argument in a more manageable form 
(because we consider approximations at several absolute values). 
This should not be confused with the deep problem of simultane- 
ous approximations at one absolute values (cf. the historical note). 
In §3 we sketch the proof, basing it on two propositions. We then 
prove Proposition 1 in §4 and §5, and Proposition 2 in §6, §7, §8 
and §9. Finally, in §10, we give a geometric formulation adapted 
to the applications to algebraic geometry and possible generalizations 
to varieties of arbitrary dimensions (cf. again the historical note). 

The proof of Theorem 1 is self contained and elementary (the 
use of the Gauss lemma of Chapter III obviously fits this descrip- 
tion). 


§1. Statement of the theorem 


Throughout this chapter, we shall assume that K is a field with a 
proper set of absolute values Mr. 

(As we deal with a fixed field K, we could omit the condition 
that the absolute values are well behaved: It will never be used.) 

We denote by S..a fixed finite, non-empty subset of Mx, contain- 
ing all the archimedean absolute values, if there are any. If S is 
a finite subset of Mx, we denote by Js the subset of elements 
x€K such that 


[vle sl for all véS 


and call this the S-integers. If SDS. then Js is a ring, taking into 
account the non-archimedean nature of the absolute values outside 
S. If S=S. we also write J. or J instead of Js,, and call this 
simply the integers. We shall always assume that K is the quo- 
tient field of J.. 


on 
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If all absolute values of Mg are non-archimedean, then the set 
of elements x¢ K such that 


[et ee ee for all ve Me 


is a field, and all its non-zero elements have in fact absolute value 
1 for all ve Mg. One sees this immediately from the non-archime- 
dean property and the product formula. This field will be called 
the constant field and will be denoted by k. Given an Mg,-divisor 
dD one sees that the set L(d) consisting of those elements x¢ K 
satisfying |x|, <d(v) is a vector space over k, again by the non- 
archimedean nature of the absolute values. 

The following two axioms which we shall consider are very weak 
forms of Riemann-Roch theorems. 

Axiom la. There exist numbers C,, C2, By > 0 depending only on 
K and S. such that for all B= By, the set of elements x€I. such 
that 


ele ee oo all ve S. 


is finite, and the number of elements 2(B) in the set satisfies the 
inequalities 


Ce a4 26.5 


where N= Syes,,No- 

Axiom lb. All absolute values of Mrz are non-archimedean. There 
exist numbers C,, C2, By) > 0 and an integer d > 0 depending only on 
K and S. such that for all integers v > 0 the set of elements x€ In 
such that 


[lee 25 all ves. 


is a finite dimensional vector space over the constant field k, whose 
dimension I(B>) satisfies the inequalities 


Cy =1G,) = ee 


We note that Axiom la is a consequence of Theorem 2, Chapter 
II whenever K is a number field, and S.. the set of archimedean 
absolute values. If K is a function field in one variable over the 
constant field k, then Axiom 1b is a consequence of the Riemann- 
Roch theorem, which asserts something much more precise. In 
that case, if k is algebraically closed, the multiplicities turn out to 
be equal to 1 with suitably normalized absolute values, namely, 
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| x1, = (1/e)™v* 


e being the constant 2.718... . 

Axiom 1b also applies to higher dimensional projective, normal 
varieties. One selects an absolutely irreducible hyperplane D to 
determine the discrete valuation at infinity. After taking a high 
multiple D, = »D, one knows (from elementary facts) that J(yD,) is 
a polynomial of degree d in », so that Axiom 1b is satisfied. We 
shall use Axiom 1b to prove the next theorem. In the applications, 
one can also reduce the higher dimensional case to that of curves 
by induction (cf. [18]). In our axiomatic setup here, we may as 
well deal simultaneously with all cases, especially in view of the 
very elementary proofs which can be given for our axioms in the 
special cases of interest to us. 


TueoreM 1. Let K bea field of characteristic 0, with a proper set 
of absolute values Mr satisfying the product formula with multipli- 
cities N, 21. Let S. be a finite, non-empty subset of Mx containing 
all the archimedean absolute values. Assume that K is the quotient 
field of I... and that Axiom la or 1b ts satisfied. Let S be a finite 
subset of Mx. For each véS let a, be algebraic over K, and assume 
that v is extended to K in some way. Let « be real > 2. Then the 
elements BE K satisfying the approximation condition 


: 1 
poe, Ila —8 ||) S H®* 


have bounded height. 
As in Chapter III, we use the height 


Hx(8) = H(8)= JI sup, |] 8 lle) 


vEMR 


and || 8 |lo=|8 |)”. 


It is useful to make a few remarks concerning our statement of 
the theorem. 

(i) The theorem is strengthened the smaller we make the left 
side and the larger we make the right side of our inequality. In 
particular, the smaller we make «, the better the theorem. (It is 
known, however, that 2 is a best possible result.) The larger we 
make the set S, the stronger the theorem. 

(ii) The right-hand side 1/H(8)“ could be replaced by C/H(8)* 
where C is a fixed number. Indeed, if there were §’s satisfying 
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the modified inequality with C, whose height tends to infinity, then 
writing «=«, + & with & >0 and «x, > 2 we have 


ee 
H(B) H(8)" = H(8)y 


and for large H(8), we see that C/H(p) <1. It is a very interest- 
ing question to estimate in number fields the number of solutions 
of the inequality as a function of C, asymptotically for C > 9. 

(iii) Suppose that 8 ranges through a sequence of elements of 
K such that || a, —8||, tends to0. Weknow from Chapter I that 
the extension of v to K corresponds to an embedding of K into K». 
In particular, viewing a, as embedded in K, and thus algebraic over 
K,, if a, is approximated very closely by elements 8 of A, then 
a, must in fact lie in Ky. 

(iv) If a,a’ are two distinct elements of K, for some v, and if 
8 approximates a, then 8 stays away from a’. As § approaches 
a, its distance from a’ approaches the distance between a and a’. 
Hence it would add no greater generality to our statement if we 
took a product over several a, for each v. 

(v) There is no reason not to let 8 approach infinity, and we 
could add to the approximation condition on the left-hand side a 
product 


Tf supd, || 1/8 ||.) . 
ves 


This version of the theorem can be reduced to the other by making 
a linear projective transformation 7(8) = (a8 + b)/(c8 + d) such that 
the transform of 8 approaches elements at a finite distance for all 
veéS, if it converges at all. It is trivially checked that if T is a 
non-singular transformation with coefficients in K, then AH(T(B)) is 
equivalent to H(), i.e. each is less than a constant multiple of 
the other. 

(vi) If @ is in K, and S contains one absolute value, then the 
theorem is trivial, and with « > 1 instead of « >2. Indeed, one 
can even replace the finite product on the left by the complete 
product over all absolute values of Mx, in which case one gets the 
reciprocal of the height of (a — 8) which is equivalent to the height 
of 8. From this the assertion is clear. 

It is known that if one puts conditions of a multiplicative nature 
on the §, then it is possible to lower the value of «. This is the 
way one can interpret some known variants of the theorem, for 
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instance as in [28]. Actually, these are immediate consequences of 
the theorem as we have stated it. As an illustration, we give the 
proof of a theorem of Mahler. 


Corotiary. Using the same assumptions as in Theorem 1, and 
assuming in addition that the elements 8 of K are subject to the 
condition 


Hetl=1 


for all v € S, the conclusion of the theorem is valid if x > 0 (instead 
Ojm > 2). 
Proof: In view of our additional assumption, we have 


A(8) = eae |B lle) = IM ee 1/8 |)". 


Similarly, using H(8) = H(1/8), we have 
A(8) = TJ inf(, || B ||". 


ves 
Using Remarks (iv) and (v), and assuming say, that no a, = 0, we 
know that if € > 2 then the elements 8 of K satisfying 


[J min, |] a — 8 lls) TI] minG, 11 8 lle) [min G, || Ve ll) S ee 
vES ves ves (B) 
have bounded height. Using the equalities just derived, our as- 

sertion is now obvious. 

Finally, before starting the proof, it may be illuminating to 
consider the special case where K = Q and the set of absolute va- 
lues is the ordinary set of absolute values. Then the multiplicities 
are equal to 1, and Axiom la is obviously satisfied. 

When only one absolute value is considered, say the ordinary 
one, then the inequality has frequently been written 


taking m,n relatively prime integers. Since m/n comes very close 
to a, the two integers | m |, || must have the same order of magni- 
tude, so that instead of taking 1/| |“ for the right-hand side, one 
can replace |”| by sup(|m|, ||) which is precisely the height of 
8 =mln. This identifies the most classical form of the theorem 
with ours, taking into account Remark (iii), the completion being 
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the real numbers. 
Suppose we consider a p-adic absolute value. Then the term 
corresponding to p is sometimes written in the form 


min(1, | za — m|,) 


following Mahler. If | a — (m/n) |» becomes very small, then | m/n |p 
cannot tend to infinity. It is bounded from above by a number 
depending ona. In particular, | ™ |, and | |, lie between two fixed 
bounds depending on a, and thus having 


min (1, | za — m|>) 


is no improvement to having 


min (1, la —_— 
n 


at) 


in the product. 

We have thus seen how our formulation covers the classical for- 
mulations and their variants, and we are now ready to prove the 
theorem. 


§2. Reduction to simultaneous approximations 


Rather than deal with the approximation condition in the form 
in which we have given it, i.e. with a product, it is easier to deal 
with a simultaneous set of approximations for all v¢ S. Thus we 
reformulate our theorem in the following manner. 


TueoreM 1’. Let the hypotheses be as in Theorem 1. For each 
veéS, suppose a real number 2, = 0 is given such that 


MA=l1. 


ves 


Then the elements 8 in K satisfying the simultaneous system of ine- 
qualities 


a eee ee 
inf (1, ||@ — 8 ||») S Ha 
for all ve S have bounded height. 

It is obvious that Theorem 1 implies Theorem 1’ (take the pro- 
duct). We shall now show that Theorem 1’ implies Theorem 1. 
Suppose we have a sequence of solutions 8 to the approximation 
condition of Theorem 1, whose height tends to infinity. For such 
B, write 


[VI, §2] REDUCTION TO SIMULTANEOUS APPROXIMATIONS 97 


: 1 
inf (L, ll ee ~ B lbs) = rae 
with a real number £,(8) =0. Then by hypothesis, 
Dee Ae) = 1. 
ves 


Now select «’ such that « > «’ > 2. Choose a positive integer A 


such that 
A = - 1) >s 


where s is the number of elements of S. Using induction, and the 
obvious fact that [x + y] S [x] + [y] + 1 (the bracket being the largest 
integer <) we get 


oe pie see 
Atssa+ s[ 54550) = 3,| aSe@ |+s 
whence 
Kk 
As>, [4 Z £18) ; 
Consequently there exist integers a,(8) = 0 such that 
a(8) < [ AS E48) | < AXE,8). 


and $)a.(8)= A. From this we see that there is only a finite 
number of possible distributions of such integers a,(8), and hence, 
restricting our attention to a subsequence of our elements f if ne- 
cessary, we can assume that the a,(8) are the same for all 8. We 
write them a,. We then put 


Ae = Gs) 

so that 0<4,<1 and 5}4.=1. For each @ in our subsequence 
we have «’d, < x€,(8), and hence these 8 satisfy the simultaneous 
system of inequalities 

’ 1 

inf IL, Go — ae Se 

( || H(8) Ay 

We can therefore apply Theorem 1’, and have therefore shown what 


we wanted. 
We make one more remark which is technically useful in the 
proof. We may assume without loss of generality that all the a, 
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are integral over J... Indeed, the a, satisfy an equation 
GnY" +++ +a =0 


with a;¢K. (Take the product of their irreducible equations over 
K.) Using our assumption that K is the quotient field of J. we 
can clear denominators, and thus assume that all a; lie in J~. Mul- 
tiplying this equation by a, ’, we see that a,a, is integral over J. 
for each v. Now if 8 approaches a,, then a,8 approaches a,a, eS- 
sentially as well, and we can use a trick as in Remark (ii) to show 


that we may deal with @,a, instead of a,. This proves our assertion. 


§3. Sketch of the proof 


Let K be a field, and F(X)¢€ K[X, -+:, Xn] = K[X]a polynomial 
in several variables with coefficients in K. Let 7, ---,%m be in- 
tegers (in Z) =O and assume that the degree of F in X; is <7;. 
For any variable ¢, we have a Taylor expansion 


F(X1, ++, Xm) = SPO, +++, OE — Xi) +-E — Xn) 


the sum being taken over (7) = (1, ---, im), that is, m-tuples of in- 
tegers satisfying OS 7:57, -°--,O0 Sin S 7m. Each F™ is a poly- 
nomial, namely 
; 1 
a) =-— : 
1,! oa ! 


™m 


O:. 5 O01." F 


and 0,,---,0m are the partial derivatives. Thus the coefficients of 
F® are integral multiples of the coefficients of F, since the fac- 
torials in the denominator divide the integers which will occur as 
a result of repeated partial differentiation. Furthermore, such in- 
tegral multiples will be bounded by the product of the binomial 


( ) i ( ) 
ay Ci 


which is itself bounded by 2"!*"°*’™. This remark will be of use 
later when we estimate the absolute value of F. We also observe 
that the number of terms in our sum is certainly bounded by 

(71 +t 1) +++ Om +1) 


which is also bounded by 27!*°°'*"™, 
We shall now state two results, to be proved in the following 
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sections. We shall then indicate how Theorem 1’ follows from 
them. 

If we have K, S., J. as described in §1, and a polynomial 
G(X), +--+, Xm) in I[X] then we write 

BAG) = sup |G, 

the sup being taken over allveS.. It isa bound for the absolute 
values of the coefficients of G in S.. 

We shall meet in the statements and proofs which follow certain 
constants depending on the original data. We number them con- 
secutively and always indicate what they depend on. 


Proposition 1. Let K be a field with a proper set of absolute 
values Mx. Let S. be a finite non-empty subset of Mx containing 
all the archimedean absolute values. Let I. be its integers. Assume 
Axiom la or 1b is satisfied. Let f(Y) be a polynomial in one variable 
in I..[¥Y| with leading coefficient 1 and of degreen. Let 1,72, +-- be 
integers > 0, and let 0<e< 1/2. Then for all m> (2n/e) (orm > 
(2n/C,e)’ in case of Axiom 1b) there exists a polynomial 


GON =, Ae) EO 
in I.([X] satisfying: 


(1) a@ee G im XA; is S473. 
(2) If +. +sm(5—s) 
i Vm 2 


then for any root a of f, we have 
G"(a,---,a=0. 
(3) BiG sO, = wncre 0, — C,(K, Sa.) ) 
Remark. From now on, all constants C,, Cs, --- depend on K, Sx, 
f only, unless otherwise specified. In each case, they can be easily 


evaluated. 
The second proposition will give us a means of preventing a 


polynomial from vanishing at a point (f, -+:, Bm). 

Proposition 2. Let K be a field of characteristic 0, with a proper 
set of absolute values Mx satisfying the product formula with multt- 
plicities N,=1. Let m be an integer >0. Let 0<6<1/16"*”. 
Let G(X, --+, Xm) #0 be in I..|X] of degree S rj in X;._ Assume that 
the r; satisfy 
(4) Or: >72,°-+,Or%m >10 (we say they are 6-decreasing). 
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Let B,, +++, Bm be elements of heights H;,---, Hn respectively satisfy- 
ing 

(5) Nlog4+4m S dlog A, 

(6) 7, log Ay = 7; log A; for 7 =i)". . 

Assume also that G satisfies 

(e BiG Ho, 

Then there exists an m-tuple (1, +++, ¥m) such that 

(8) vifr, + 2+* + ymin S 20)" /™ 


and G™ (Bi, +++, Bm) # 0. 

Using Propositions 1 and 2, we show how to prove Theorem 1’. 

We assume that we have elements 8 in K satisfying our simul- 
taneous inequalities whose heights tend to infinity, and show that 
given e, we must have «<2+e’ where ec’ is small when e is 
small. We can of course assume that e < 1/10. 

We choose m, 0, 81, -°-+, Bm, 71, ‘**,%m in that order, subject to 
certain conditions, as follows: 

We first select m > (2n/e)Y or > (2n/C,ey so that we can apply 
Proposition 1. We then select 6 < e so that (20)"6"’»™” is also less 
than e. 

We now select #, of height AH, so large that (5) is satisfied, and 
that d log H; > log C; (the constant of (3)). 

We select $2, ---, Bm of heights H2,---, Hm respectively such that 
they increase logarithmically d-rapidly, that is to say 


26 log H, > log A, 


20 108 Hoe Al,.-¢ - 
We choose an integer 7, so large that 


710 log H, > 10-log H; formally —2),---m 


and then integers 72, --+,7%m such that 
log H, , . log 7, 
r = 7; < || aA 
‘cee * is "og H; 


It follows immediately from the above inequalities that the 13 
satisfy the condition 
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(9) r, log Hi <7; log H; S (1 +e) log H, 


even with [1+ (1/10)é] instead of (1+) on the right-hand side. 
From this one concludes that (4) is satisfied, i.e. the 7; are d-de- 
creasing, namely, 

75[7 541 = (15/71) Milr41) 


and using (9) we get (4) at once. 

We now apply Proposition 1. We had seen at the end of §2 that 
we could assume the a, to be integral, satisfying a polynomial of 
degree » over K with leading coefficient 1. We find a polynomial 
G as in that proposition. By condition (3) we conclude that 


BUG) 2A," . 


By Proposition 2, some derivative G” =F is such that F(8) #0 
and v,/7; + +++ Ym/%m Se. Consequently, by (2) it follows that if 


tilt, + +++ + tm/tm S mA/2—e)—e 
then for any root a of f(i.e. any a,) we have 
F™ (a,+:-,a)=0. 
We estimate || F(8)||, for v¢S such that 4, #0. In the expan- 
sion 
F(B) = SF (ae, +++, @y) (ay — Bi)! +++ (ty — Bm)™ 
all the terms will be 0 except those belonging to an (7) such that 
11/7, + +++ + tlm 2 m(1/2 — 2e). 
The total number of terms in the sum is bounded by 
grits--+rm < gery < Ao 
since the r; are decreasing. An upper bound for | F (a, +++, @) |» 
is certainly 
iON CN (Cae Ga 


using the fact that we chose H, large. The constant C; need merely 
be a bound for the value of a, at v. Thus putting everything to- 
gether, we get 

|| F(B) lo = Hi""** sup || ay — Bi Ilo +++ ||» — Bem I” 


the sup being taken for those (2) satisfying the condition mentioned 
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above. Using the hypotheses of Theorem 1’, we get 


5 

8bmryNy 

| F(8) lle S He Sai (Hi... Hime ‘ 

We now replace i by 741/71, °-+,im DY Ymim/?m. We then replace 


Hii by Hj using (9), making the denominator smaller and the 
fraction bigger. We then replace 7,/r: ++-+++ tm/%m by m(1/2 — 2e) 
thus finally obtaining 

1 


Jor (1/2-28) KAy * 
1 


Gell, 2 ee 


If ve S and 4, = 0, we estimate || F(8) ||, naively, and get 
\| F(8) lp S H2°""'*" sup(1, {| Br ||o)++-sup(l, || Bm ||) 


using the same kind of crude estimates as before for the number 
of terms, coefficients, etc. 
If v ¢ S, so that v is non-archimedean, then we simply get 


| FB) ||» S sup(1, || B: |le)*++-sup(1, |] Bm |lo)™ . 


We now take the product over all ve Me. We almost get the 
heights of the 8; on the right, up to a finite number of factors. 
We make the right-hand side only bigger if we add these factors. 
Having done so, we use (9) which guarantees H7i < H}'"*" and 
thus obtain 


Hee ae (1+2) 
Ml Il (8) Ile SS Frere . 


vEMK 


Since F(8) #0, the left-hand side is equal to 1, and we take logs. 
Having done so, we cancel mr,, and get 


e522) SON +1 +6. 
From this we immediately get « < 2+’ as desired. 


§4. A combinatorial lemma 


Let r1,+++,1%m be integers = land lete >0. The number e of sets 
of integers (i1,°*-,im) satisfying 


and 
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ee 


‘; Vn 


does not exceed 
il 
wre! +1) +++ (%m +1) 


Proof: By induction on m. The assertion is trivial if m= 1. 
Take m>1. We write 


it! a. 
m( ; :) =m 2) 
so that me = 2/2. In terms of 4 our upper bound reads 


Q2m)/? 


A 


(ge aaa = lee 


Our assertion is trivial if 4< 2m’. We may therefore assume 
that 2 > 2m’. For each in, 7m fixed, we consider the solutions of 
i 


1 oe a 
sy. ES (m—2- 
T; Vm-1 ‘m 


m2 
i Ae ial hn 


a 2 


We apply the induction hypothesis and sum over 7m. For this, we 
first make a computation: 


é 2 ae 1 1 | 
= Se | een 


oa: 2) 
a DT sears — 2i/ry’ 
ay 2h 
<a ——— = 1 
Seg! * a1 


We now let i=, andy =/7m. Our number é is therefore bound- 
ed by 


1/2 
Se es ee a a)> *(%m—1 + 1) 
< 2A(m — ibis 


ee Oe ers +1). 


We must therefore check whether 
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240m = 1) 2m‘? 
7-1 UA 

or equivalently, whether 

lm — 1)/m]'? < (2 — 1/2’. 
Observe that (1 — 1/m)'/”? < 1—1/2m (square the right-hand side). 
It suffices therefore that 1—1/2m<1-—1/4°. But this is true in 
view of our original hypothesis on 4, and our lemma is proved. 

§5. Proof of Proposition 1 | 


We have to use either Axiom la or lb. Let us start with la, 
and indicate afterwards the modifications of language necessary to 
use Axiom 1b instead. 


The number of polynomials F(X, ---, Xm) such that 
(10) deg F in X; is S17; 
a) Fe I. [X } and BAF) = B 


is at least (C,B)¥'‘1t? "mt? For each set of integers (7) satisfy- 
ing the condition of (2), we let RF be the remainder of the divi- 
sion of F(Y,---, Y) by f(Y). The degree of F(Y, ---, Y)(which 
is a polynomial in one variable) is at most 7, +--+ +7m. Using 
the remarks at the beginning of §3, and long division, we find by 
an inductive estimate that 


(12) BuARF®) Ci “aR 


where C; depends on f and S. only. In this way, each polynomial 
satisfying (10) and (11) gives rise to a set of e remainders 


(RES sey RE 
where é is bounded as in the preceding section. Each remainde 


is of degree < n. 
By Axiom la, the number of such sets of remainders is at mos 


(13) Oar a I8 lee : 


We want two distinct polynomials with the same set of remain 
der, say F,G. Then F —G will have remainders 0 for all (i) satis 
fying (2), and thus the conclusion of (2) will be satisfied. Furthermore; 
B.(F — G) will be at most 2B, and thus (3) will also be satisfied. 
Condition (1) is trivially satisfied since we started with (10). 
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We shall certainly get our two distinct polynomials if 
(14) (Cibo a es 5 [en* Ts) 3 lahiag 


and this will certainly be true if we replace e by the upper estimate 
derived in the preceding section. The product (7; + 1)--+(%m +1) 
in the exponent of both sides can then be cancelled. Let C,;=C?. 
It will suffice to get 


(15) Ce > (Ci eR yee . 
If now m > (2n/e)’, then it will suffice to get 
(16) Cy ee (Cs ee |? 
and thus it will in fact suffice to get 

(17) BEE SO ae, 


with a suitable C;, Letting Cs; = C7/" we get a condition of the 
type desired to satisfy (3). This concludes the proof, using Axiom 
la. 

To use Axiom 1b, we have to count linear dimensions. 

The polynomials F(X) satisfying (10) and 
(11b) Fele[X] and B.(F) < B} 
are a vector space over the constants k of dimension at least equal 
to (71+ 1D+-(%7m + DCw*. 
_ For each set of integers (2) satisfying the condition of (2), we 
find the same estimate (12) (without bothering to take into account 


the fact that we are dealing with non-archimedean absolute values). 
The map 


F->(RF™}, eee, RF“ *) 


s now a &-linear map. By Axiom 1b, the dimension over k of the 
‘ector space which is the image of this map is at most 


13b) ne [vol sp Bad sy i +p]? . 


‘he integer v) is determined so that C; < By°. We now want the 
ernel of our &-linear map to be larger than 0, and for this it will 
suffice to have the inequality of dimensions: 


(14b) (ey Ca? = na—(r, aa eng eeei 


+ (vo(rr t t+ Ym) + y]?. 
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We cancel the product of the (7; + 1), and take m > (2n/C,e)’. We 
then see that it suffices to achieve 


> Dbvlts to tim) +0 
and taking a d-th root, it is enough to get 
(1 ai) > Pita 72? + Femi 


Putting this condition in terms of B.(F), we see that our proposi- 
tion is proved. 


§6. Wronskians 


Let R be a ring. A derivation d of R isa linear map of FR into 
itself satisfying 
d(xy) = x-dy + y-dx. 


Those elements x of R such that dx =0 form a subring (because 
d(1) = 0) containing the prime ring, and are called the ring of con- 
stants. This ring of constants is a field if R is a field. A differ- 
ential ring is a ring together with a finite set of derivations d,,---, 
d, which commute with each other (as operators on R). Its sub- 
ring of constants is by definition the intersection of the rings of 
constants of each d;. 

Example. Let k be a field, and RF the polynomial ring kLX,,---, Xn]. 
Let d; be the partial derivative with respect to X;, taken formally. 
Then R is a differential ring, and if & has characteristic 0, it is 
the field of constants. 

If R is a differential ring, and d,,---,d, its derivations, we can 
form monomials 

Bein s (ee 
where iy are integers 20. These monomials can be viewed as li- 
near operators on FR in the obvious way. The degree of the above 
monomial is by definition 7; + --: + 7,. Wedenote by D(r) the set 
of monomials of degree < 7, where 7 is an integer = 1. 

The following proposition will be applied to the case* of poly- 
nomials in several variables, as above. I am indebted to Kolchin 
for the formulation and proof in its present degree of generality. 


Proposition 3. Let F be a differential field, k the field of constants 
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for the derivations d,,---,d, of F. Let D(r) be the monomials of 
degree <r in the derivations. Let x,,+-+,%n, be elements of F. If 
they are linearly dependent over k, then 
det (4;x;) = 0 

for all operators 0,,+++,0.€ D(r) and any r. Conversely, tf this de- 
terminant is 0 for all choices of 0;¢€ D(i), then they are linearly 
dependent over k. 

Proof: The first assertion is trivial. We prove the second by 
induction on 2, and the assertion is trivial for »=1. We assume 
nm>1. Observe that 6,=1. We consider determinants: 


Ga OX. =yeus Oier 
6;€ Di) - 


OnX1 OnXe-°* OnXn 


If all the subdeterminants of size m — 1 above the last line are 0 
for all choices of 6;¢ D(z), then any m — 1 among the x’s are linearly 
independent by induction, and we are done. Thus we may assume 
that some determinant of size  — 1 above the last line is not 0, say 


x1 APY MEO Xn-1 
64%, 83X2 oD 05% aa 
i #0 
Re a RG Gre itni 
for some choice of 6;¢ D(@),i=1,-+:,n—1. 


We complete this determinant to one with rows having ” elements. 
Then these rows are linearly independent over F since the smaller 
rows are already linearly independent over Ff. Hence any row 

(0x1, +++, Xs) G€ D(n) 
is linearly dependent over F on our expanded ” — 7 rows, and thus 
the matrix 


xX eee Xn 
Obx, ee O3 Xn 
(RE er 0 (ee 
Ox Sa6 OXn 


QX1 see PXn 
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has rank m —1 for any choice of 6,¢,--- in D(m). It therefore has 
column rank equal to  — 1, and hence there exist elements a,,---, 
a,¢€F with a, =1 such that for all 6€ D(n) we have 


(18) a,Ox, ap oor nO 0 


because the last column can be expressed as a linear combination 
of the first (2 —1) columns. Let d be any one of the derivations 
d;. Applying d to (18) we get 


a,(dOx,) + +++ + An(dOxn) + (da,)Ox%1 + +++ + (dan)Ox, =0. 


In this last expression, let A(@) and B(@) denote the first sum 
involving the dé, and the second sum respectively. We have 
A(é) + B(@)=0. If @ happens to lie in D(m—1) then dé lies in 
D(n) and hence A(#)=0 by (18). This holds in particular for 
everyone of the derivations @; and gives B(@;)=0 for all 7. This 
is a set of m —1 equations involving ~ — 1 variables 


da,, Sees dan-1 
because @, =1 and da, =0. The determinant of the coefficients 
is not zero, and hence da; = --- = da,-1=0. Since d was any one 
of the derivations d; it follows that a, --+,@,-; are constants. 


Letting 6 =1 in (18) proves what we wanted. 


§7. Factorization of a polynomial 


Although it is not true that a polynomial in several variables 
factorizes into a product of polynomials in fewer variables, we shall 
see, nevertheless, that a suitable derivative of it does. This will 
allow us to carry out an induction argument later. 

Let K be a field of characteristic 0. We consider the polynomial 


ring K[X.,---, Xm] = K[X] with the standard partial derivatives. 
By a differential operator D we mean 
oe we @im 


ll 000 fen 


and we call 7:+ ++ +%m the degree of the operator. Applied to 
any polynomial G, it yields another polynomial whose coefficients 
are integral multiples of those of G. 


Lemma 1. Let K be a field of characteristic 0. Let G(X)= 


[VI, §7] FACTORIZATION OF A POLYNOMIAL 109 


G(X, +-+,Xm) be a non-zero polynomial in K[X] in m variables, 


m = 2, of degree <r; in X; for eachj =1,---,m. Then there exists 
an integer I satisfying 
(19) PETS 7. + | 


and differential operators D,+--, Di-1, on the variables X,, +++, Xm-1, 
of degrees <0,---,/—1, respectively, such that if 


RCGr ee esac (0.086) Ce ees 


then F +0, and 
F(X, See ENae) 3 UX, aaa oe Aoi) VOXGn) 


where U,V are polynomials, U is of degree at most Ir; 1n X; for 
j =1,---,m—1 and V is of degree at most Irn in Xm. 
Proof: We consider all representations of G in the form 


G(X) = Gol Xm)Pol Xs, +++, Xma1) +s + Gis Xm)Pi-i(Mi, +++, Xm-1) 


where the ¢g, and ¢, are polynomials with coefficients in K, of de- 
gree at most 7; in X;. Such a representation is possible, for in- 
stance, with /—1=r7,, and g(Xn)=Xm. From all such representations, 
we select the one for which 7 is least. Then 


Pi Xm), -* +, Pi-1Xm) 


are linearly independent over AK, for, if not, one would immediately 
deduce another representation of G as above with a smaller /. 
Similarly, 

b( Xi, pa Gea); aia di-(X1, eae) Xm-1) 


are linearly independent over K. AlsolS/ 57,41. 

We now apply Proposition 3 to these two sets of polynomials. 
The 6’s in Proposition 3 were taken with coefficient 1. Since we 
are in characteristic 0, putting a factorial in the denominator re- 
sults in no loss of generality, and hence there exist differential 
operators Do, --:, Di-1, in the variables X,,---,Xm-1, of degrees 
<0,---,/—1, respectively, such that 


UX, Ses P.O) a det (Du) ae o> 0, TaD Ue 1) 


is not 0. Similarly, 


V(iXm) = det(—- axe, ) = 0. taal = 1) 
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is not 0. Multiplying these two determinants, we see that UV= 
F. The degrees of U, V in the X; obviously satisfy the required 
condition, since the determinants are 7 x I. 

We now estimate the coefficients of F in terms of those of G. 


Lemma 2. Let the notation be as in the previous lemma. Let v 
be an absolute value on K. If v is non-archimedean then |F \,<|Gli. 
If v induces the ordinary absolute value on Q, then 


(Pipa 1Gr 1s Ge SDN 2 eee 


Proof. If v is non-archimedean, our assertion is obvious. If v 
induces the ordinary absolute value on Q, we regard G as a sum 
of (7; + 1)---(%m +1) terms, each of the form 


IPO Gs 


where |a@ i) |S |G|. Wecan develop our determinant into a sum of 
(7: + 1)---(7m + DJ’ determinants, the general element in each such 
determinant being of the form 
z i y ay a 
ai) Du—- Om Le, G5, 

pens 
By the remarks at the begining of §3, we see that the effect of 
the differential operators on the estimates is at most to multiply 
them by 2"'*'*"™. Hence the coefficients of each of the /! terms 


in the expansion of an individual determinant have absolute values 
not exceeding 


[| G grit ttm! 


and our assertion follows. 


§8. The index 


Let K be a field, F(X, +--+, Xm) a polynomial in K[X], and F+0. 
Let 71, °++,%m be numbers > 0, and fi, ---, 8m elements of K. To 
simplify the notation, we denote by 7m, and Bim, the vectors of 
the r’s and #’s respectively. 

Let ¢ be a new variable. Consider the expansion 

F(B, + Xyt'/", «++, Bm + Xmt'!*™) 


= SP e:. vee, Bm)Xit- , X impis/rrt tim/ tm ; 


We can view the right-hand side as a polynomial in fractional 
powers of ¢t. We let 
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9 = indr in) Bim i 


be the smallest number occurring as an exponent of ¢ on the right- 
hand side such that ¢® has a non-zero coefficient, and call it the 
index of F at B.,,, relative to7,,,,.. Then @ is also the minimum of 
the numbers (i,/7; + -++ +in/%m) such that F'(@,, +++, Bm) #0. 

We note that the index is always = 0, and equals 0 if and only 
if F(B,,-+-, Bm) = 0. 

It follows immediately from the definitions that the index satis- 
fies the following properties: 


Inv. 1. Jf F,G are two polynomials, neither of which is 0, then 
ind(F + G) = min (ind F, ind G) 
ind(FG) =indF+indG. 


(because the degree of a product is equal to the sum of the deg- 
rees). Jf F is a polynomial in X,, +--+, Xm-1 and G ts a polynomial 
in Xm, then 


ING + im) , Bim \G) = ind, ea Pine x IND rin Bn(G) C 
Inp 2. The index of 


Oy... O4mE 
at B.,, relative tO rm, 1s at least equal to 
nde) Fan) — Bilti — +> — PmlTm » 
for any integers v1,+++,¥m 20, provided the derived polynomial is 


not identically 0. 
Inp 3. Let 1 be a number > 0, and Irn, = (ln, +++, lr,). Then 


IND tr (my 8 (my (F) ae ind iq) Bom) F) 3 


We now wish to compare the indices of G and F in Lemma 1. 
Lemma 3. Let the notation be as in Lemma 1. Let 0< 681. 
Assume that 
(20) Orn seo Oi = 10. 


Let B:, +++, Bm be elements of K, and put ind = indy m),Bim)- Let O= 
ind(G). Then 


6 oor 0 =< ind (F) +43. 
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Proof: Consider a differential operator of the form 


Da, = ee Se a om! 


1! et! 
on Xi, °-:,Xm-1 of degree </—1. If the polynomial 
D,L0%.R 
y! 


does not vanish identically, its index at @,,,. relative to 7, is at 
least 


a 4/11 Be ie oe let Tae — vltm 5 
We make this number smaller if we replace the 7; (7 = 1, -+:, m—1) 
by 7m-1. Hence this index is at least 
I-11 y Vm 
(2S —p/tm. 
Gl ae Ym —1 hile 


Since 7m/7m-1 < 6, and since the index is never negative, it must 
be at least 


max (0,4 — v/rm) — 6. 


If we expand the determinant giving the expression for F, we 
obtain for F a sum of /! terms, the typical term being of the form 


1 = 
e (DusQ)( Day = On G) = (Dar aan” G) 


where the D, have degree </—1. By one of the basic properties 
of the index, if such a term does not vanish identically its index 
is at least 


Al 
o = 3) max(0, 0 — v/rm) — 16 . 
v=0 
Since F is a sum of such terms, and is not 0, it follows that 


ind(F) 2 o. We must now solve for 6 in terms of ind(F). We 
can assume 67, > 10, for if not, we have 


62 10/rn <6 


and our inequality is certainly satisfied. 
If 07m <1 then we have 
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l—0 l- 
>) max 0, 8 — v/rm) = Lm 3 max (0, rm — v) 
v=0 v=0 


> Urn S, (Orn) — v) 


OSvS[Orm] 
[67m|([Orm] + 1) 
Dien 
all 
a 


If @ is a number > 10, then (a — 1)?/2 > a’/3. Consequently, since 
Grm > 10, we have by a trivial computation 


[07m]?/2%m = (O%m)* 3m = Or m/3 . 
Thus in this case, ind(F) = 6’rm/3 — 1d and 


a 


e< + inary +tta< 
Ya l Vm I 


™ 


ind (F’) + 46, 


since 7m 2 10 and 1 < 7m +1. 


On the other hand, if 6rm = 1 we have 
l= l= 
S* max (0, 0-— »/rn) = 50 — virm) = 10/2 . 
yv=0 v=0 


From this our assertion is again obvious, and this concludes the 
proof. 


§9. Proof of Proposition 2 


We come to the proof of Proposition 2. We shall in fact prove 
a slightly stronger version of it. If AK is a field with a proper set 
of absolute values Mx satisfying the product formula with multi- 
plicities N, = 1, and F(X, --:, Xm)€ K[X] is a polynomial, then we 


define as usual 
HF)= Jf |Fle*= IL llF lle. 


vEUR vEMK 
Proposition 4. Let K be a field of characteristic 0, with a proper 
set of absolute values Mx satisfying the product formula with multi- 
plicities N, 21. Let S be a finite subset of Mx containing the archime- 


dean absolute values, and let N= >) Nv. Let 


ves 


(21) =o tile : 
Let r;(j =1,-+-,m) be integers satisfying 
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(4) Or, > 2, °+*, Orm > 10. 

Let G(X, -+++,Xm) be a@ non-zero polynomial in K(X|\ of degree 
<r; in X; and let Bj =1,---,m) be elements of K with H(8i)= 
H;. Assume that: 


(22) Nilog4+4m S é log Hy 
(23) r, log H, S 7; log H; 
(24) HG) ae 
Then 


indvgy Ag hev eee 


Remark. What we want, of course, is that if 6 is small, the 
coefficients of G are small, and the 7; are é-decreasing, then the 
index of G is small. The hypotheses are made precise in order to 
load the induction. 


Proof: We prove Proposition 4 by induction on m. Suppose first 
m=1. Write G(X) =(X — $)F(X) and suppose degGsr. By 
Gauss’ lemma we have 


H((X — 6) H(F) s H(@)4 . 
For each absolute value in Mg, it is clear that 
Css FENN I ep) 


and hence H( 8)’ < H[{(X — £)']. From this and the fact H(F)2=1 
we get H(8)’ < H(G)4"”, whence 


e log H(8) < log H(G) + rNlog4. 
By hypothesis and conditions (24) and (22) we get 
es Nér+rd 
and consequently the index ind,,s(G) = e/r satisfies 
ers hN+1) S60”, 


if we use (21). Thus the case m =1 is proved. 

Let m>1. We use Lemma 1 and get a polynomial F= UV as 
in this lemma. The estimate for its coefficients obtained in Lem- 
ma 2 can now be transformed as follows, using (4): 


| Fle 2e 2 oF Gy 
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replacing (7; + 1) by 2"i and /! by /' < 2". Using (22) and (24) this 
yields 


y= BA HGy< yer | 


We also know that the degree of F in X; is < /r;. 

Since F = UV and the variables are separated, one can verify 
immediately that H(F) = H(U)H(V), and hence we get the same 
estimate for H(U) and H(V) that we have just obtained for H(F), 
because the height is always 21. Both U and V have degrees 
<r; in X;. In (24) we can replace G by U or V and 6 by 26, so 
that it remains valid (using (23) for V). In (4) we can also replace 
6 by 26 and 7; by ir; so that it remains valid. Thus the 
induction hypothesis is satisfied for each one of U, V with Jy; and 
20, 

Using Property 1 of the index, we get by induction 


IND m1) Bom—»(U) S (20)" "(20)" 
iMG ryp AV) S (20)(28)” 


The index for V is certainly at most equal to the index for U. 
The index of F at Bim) at Bim relative to /7im) is the sum of these 
two. Using Lemma 3 and the fact that 46 < 4(26)’” we get 


6° or 6S (4-3-V 2 )(20)"-1(20) 1/2)" 


from which we get 


tm Bm 


= 20yaiZoee™ 
thereby proving our proposition, and Roth’s theorem. 
§10. A geometric formulation of Roth’s theorem 


Throughout this section, K is a field satisfying the hypotheses of 
Theorem 1, so that this theorem is valid for it. 

We reformulate Theorem 1 slightly, as an intermediate step to 
the final statement of Theorem 2 below. 

Let G(Y) be a polynomial in K[Y](one variable) and assume that 
the multiplicity of its roots is at most r for some integer r>0. Let 
S be a finite subset of Mz. Let C >0beanumber, andx >2. Then 
the solutions B in K of the inequality 


‘ Cc 
Tdint (1, || GB) ||.) S Hp 
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have bounded height. 
To prove this, we may assume that G has leading coefficient 1, 


and say 
d * 
G(Y) = J] (Y — ai)* 
=] 


is a factorization in K. We extend our absolute values of S to K 
in any way. The expression on the left-hand side of our inequality 
is greater or equal to 


d 

DL Tf mt, Ne Seal 
vES 7=1 

which is itself greater or equal to the expression obtained by 

replacing e; by r for each i. Now we are in the situation of Theo- 

rem 1, taking into account the remark (iv) following it, namely, 

the solutions 8 of the inequality 


d 
Il If inf, 1@—a lly se 
vES i=1 (B) 
have bounded height, hence so do the solutions of our original in- 
equality. 

We observe that Theorem 1 is in fact equivalent to this theorem, 
for if a is algebraic over K, we take for G(Y) its irreducible equa- 
tion over K. Its roots occur with multiplicity 1, and if 8 appro- 
ximates a, then it stays away from the other roots, at a distance 
greater than some fixed constant, once it is sufficiently close to a. 
We can do the same for each a,, and thus we see that the impli- 
cation is trivial. 


THEOREM 2. Let W be a complete non-singular curve defined over 
K. Let z and y be two non-constant functions in K(W), and let r 
be the largest of the multiplicities of the zeros of z. Assume that y 
is defined at all the zeros and poles of z and gives an injective map- 
ping of this set of zeros into K. Let « be a number >2 and let 
C>0. Then the points Q¢ Wx which are not poles or zeros of z 
and are such that 


: G 
ML oyhGe || 2(Q) ll) = HO@Qy™ 


have bounded height Hy. 
Proof: Without loss of generality, we may assume that YOU 
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K(z, y)(otherwise, replace W by a non-singular model of K(z, y)). 
We may also assume that the absolute values 


| W1Q) |e 


are bounded for all ve S. Indeed, if there is an infinite sequence 
of points @ whose height H,(Q) = H(»(Q)) tends to infinity and 
which satisfies the above inequality, but with | y(Q)|, unbounded 
for some véS, we then consider a non-singular linear transforma- 
tion on y, say y’=(ay + b)/(cy + d), with coefficients in K, which has 
the same property as y in the statement of the theorem, but such 
that | y’(Q) |, is bounded for all ve S, and a subsequence of points 
Q. H, is then equivalent to H, (trivial by direct verification, or 
use Property 3 of heights). If S has only one absolute value v, then 
we can actually take 1/y instead of y. 

Let @ be the set of zeros of z. -Since y has no pole in 9, it is 
integral over the local ring 0 of the point z=0 in the function 
field A(z). Let F(Y) be its irreducible equation over 9. Then 


F(Y) =G(Y) (mod 2) 


where G(Y) is a polynomial with coefficients in K with leading 
coefficient 1 and mod z means modulo the maximal ideal of 0 ge- 
nerated by z. 

By hypothesis, y induces an injection Q — y(Q) of Q into K. The 
multiplicity of a root of G(Y) is thus < the multiplicity of a point 
on W in the inverse image of the point z = 0, since it is the mul- 
tiplicity of a zero of z. (One can see this formally for instance as 
follows: Let (y") be an affine generic point of w over K all of 
whose coordinates are integral over 0. If (y™,--+:,y) is a com- 
plete set of conjugates of (y"’) over K(z), then the cycle on W 
which is the inverse image of z=0 consists of a specialization 
(7, ++, 9'™) of Cy, ++,” ) over z->0. The conjugates of y 
correspond to the conjugates (y”’,---,y'”), and one can then use 
IAG, Theorem 2 of Chapter I, §4 applied to the polynomial F(Y). 

We can write 

EY) =GaepcAc, Y) 
where A(z, Y) is a polynomial in Y with coefficients in ». Since 


A(O, Y) is defined, so is A(z(Q), Y) for small values of || 2(Q) ||., 
which is all that we need to consider. Thus the values 


I! AQ), ¥Q)) IHo 
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remain bounded since we may assume that || y(Q)||, remains bounded 
for all veS. Since F(y)=0, we get an estimate for G(y(Q)), 
namely, 


Il G(y@)) |lv S C7 || 2(Q) Ilo 


for some constant C’ and all ve S. Taking the usual product, we 
get 


[I inf (1, || G(x(Q)) ||.) S$ C” inf 1, |] 2(@) Ile) 
2 a 
~~ ACW@))" 


which leaves us in the situation discussed at the beginning of this 
section, and concludes the proof. 

Actually, it will be even more convenient to make use of the 
projection property of Chapter IV,§4. Of course, in proving these 
properties of heights, we worked with a ground field F and abso- 
lute values which were well behaved in order to state our theorems 
for the absolute heights. Here, it does not matter since we shall 
work over a fixed field K (we incorporate the multiplicities N, di- 
rectly into the data). Properties of heights clearly remain valid 
under our present set up. Thus from the projection property, we 
get: 


TueoreM 3. Let W be a projective non-singular curve defined over 
K. Let & be a non-constant function in K(W) and let r be the lar- 
xest of the multiplicities of the poles of ». Let « be a number > 2, 
C a number >0. Let S be a finite subset of Mg. Then the points 
Qe Wx which are not among the poles of $, and are such that 


I sup (1, || $(Q) ||-) 2 C- HQ)" 


have bounded height. 

Proof: We let ® be the set of zeros and poles of ¢. We make 
a projection to get a function y so that we can apply the preceding 
theorem. We have also found it convenient to invert the formula, 
taking z = 1/¢. 

In the applications, we could either apply the formula directly, 
or work with only one absolute value in S. For simplicity, we 
shall follow this second alternative in the next chapter. 
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Historical Note: 


We refer to Schneider’s book for a complete historical note on 
the Liouville-Thue-Siegel-Mahler-Dyson-Schneider-Roth theorem. 
Mahler’s contribution was to have included several primes (i.e. 
taking the product). 

The proof follows that given by Roth without any essential change. 
A deeper understanding of the situation would come from three 
direction: 

(1) Making the result quantitative. For instance, letting «= 
2+ € and finding a bound for the height of 8 in terms of & as 
&— — 0(cf. [8]). Another possibility: Allowing C/H(§)* on the right- 
hand side of the approximation condition and finding asymptotic 
bounds for H(f8) in terms of C> o., 

(2) Dealing with simultaneous approximations. Here, it seems 
to be generally conjectured that the following statement should be 


true. Let ai, -+-,a@, be algebraic over K, and assume that 1, a, :--, 
a, are linearly independent over K. Then the height of points 
(1, B:,--:, Bx) = 8 with 8;¢ K such that the simultaneous conditions 
i| 
ee Bal Z 
(8) 


hold is bounded if « >1+1/n. (Over the rationals, one puts all 
the 8; over a common denominator, with relatively prime numera- 
tors, and one can therefore replace A(f) by this denominator.) 

(3) Generalizing the geometric formulation to, say, projective 
non-singular varieties V. The condition « > 2 should be replaced 
by a similar one (instead of 2, perhaps 1 + ” where m = dim V), and 
the multiplicity 7 by a multiplicity which must be a biholomorphic 
invariant, so that it remains the same in unramified coverings. This 
is essential if one is to apply the theorem to isogenies of abelian 
varieties, (cf. the comments at the end of Chapter VII). In this 
connection the case of linear functions has been investigated, and 
and we refer the reader to a paper of Schmidt for asymptotic 
results concerning approximations to almost all numbers. (W. 
Schmidt, A metrical theorem in diophantine approximation, Canadian 
Journal of Mathematics Vol. 12 (1960 pp. 619-631, Theorem 2.) 


CHAPTER VII 
Siegel’s Theorem 


If C is an affine curve defined over a ring R finitely generated 
over Z, and if its genus is 21, then C has only a finite number 
of points in R. This is the central result of the chapter. We 
shall also give a relative formulation of it for a curve defined over 
a ring which is a finitely generated algebra over an arbitrary field 
k of characteristic 0. In that case, the presence of infinitely many 
points in R implies that the curve actually comes from a curve 
defined over the constant field and that its points are of a special 
nature (excluding possibly a finite number). 

In $1, we put together Roth’s theorem and the weak Mordell- 
Weil theorem to show that integral points are of bounded height. 
It is actually more convenient (although not really more general) 
to deal with a non-constant function ¢ whose values ¢(P) lie in a 
ring of the above type. 

In §2, we then consider the two standard examples and prove 
the statements of absolute and relative finiteness. 

To deal with curves of genus 0, it will be convenient to use the 
decomposition theorem (although in this special case, it would need 
less machinery to prove it). Suppose we deal with a projective 
non-singular variety defined over a field K with a proper set of 
absolute values which we assume to be discrete fora moment. For 
instance, we may take a number field together with the set of all 
its proper non-archimedean absolute values, or a subset of it, ex- 
cluding only a finite number of them. Then, given an element of 
K, we have its Mx-divisor or ideal (the same thing in this case). 
Given a projective non-singular variety V defined over K, let ge K(V) 
and suppose that we write the divisor of g in terms of its decom- 
position into prime rational divisors of V over K: 


(g) = maT 


where the 7, are distinct and the m, are integers. If P is any 
point of Vx which does not lie in the support of the zeros or poles 
of vy, then g(P) is an element of K. We shall describe its ideal 
decomposition in terms of the 7,. In fact, with each prime divisor 
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T of V over K and each point P, we shall associate an integral 
M,-ideal a7(P) such that 


(p(P)) = maar (P) 


up to a fixed number of components depending only on ¢ and not 
on P. Furthermore, if V is of dimension 1, then the different ar, 
are “relatively prime” up to a fixed ideal. 

In developing the theory, we do not restrict ourselves to discrete 
absolute values, and the hypothesis that V is projective non-singular 
is used only to guarantee the existence of the theory. Except for 
that, we can proceed axiomatically and assume only that V is normal 
and complete. 

We shall apply the decomposition to curves of genus 0 to classify 
those which have infinitely many integral points. If gy is a non- 
constant function on V, V is assumed to be of dimension 1 as above, 
and if R is a subring of K all of whose elements are integral for 
the valuations in Mx, then g(P)¢R means that for each pole T of 
o, our ideal az(P) lies between two fixed bounds. This will imply 
that the function ¢ is of a very particular type, which will be 
determined in § 4. 


§1. Height of integral points 


Throughout this section, we shall deal with a field F with a proper 
set of absolute values My satisfying the product formula. Let K be 
a finite extension of F, with its set of absolute values Mg. Let ve Mg. 
We shall say that K satisfies Roth’s theorem for v if the following 
assertion is true: 

Given a projective non-singular curve U defined over K, let b€ K(U) 
be a non-constant function, and let r be the largest of the orders of 
the poles of ». Let « be a number >2, and c>0. Then the points 
Qe Ux which are not poles of $, and are such that 


| PQ) |. 2 cH(Q)” 


have bounded height. 
Here, and in the rest of this section, we have put H = Hy. 
Using the weak Mordell-Weil theorem, our purpose is to improve 
this statement for curves of genus =1. We shall prove: 


TueoreM 1, Let K be as above, satisfying Roth’s theorem for one 
of the absolute valuesvé Mx. Let C be a complete non-singular curve 
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of genus 21 defined over K, let J be its Jacobian, also defined over 
K, let m be an integer >0, prime to the characteristic such that 
JxlmJg is finite. Let y be a non-constant function in K(C), and let 
p, c be numbers >0. Then the height of points P in Cz which are 
not poles of ~ and such that 


| @P) |» 2 cH( PP 


is bounded. 

In view of Property 4 of heights, we observe that if a set of 
points has bounded height in one projective embedding, then it has 
bounded height in every projective embedding. Therefore we need 
not specify the embedding in statements such as the above. 

If C is a complete non-singular curve defined over K, of genus 
21, and with a rational point, then its Jacobian J is also defined 
over K, and we may assume that C is contained in it under a 
canonical mapping. We shall take J with a fixed projective em- 
bedding, and we shall take on C the induced embedding. The 
height H = Hg is taken relative to this embedding. 


Proposition 1. Let m be an integer >0, unequal to the character- 
istic of K, and assume that Jx/mJx 1s finite. Let © be an infinite 
set of rational points of C in K. Then there exists an unramified 
covering w:U—C defined over K, an infinite set of rational points 
6’ of Uin K, such that w induces an injection of &' into S, and a 
projective embedding of U over K such that How is quasi-equivalent 
to H™’. (Of course, in How the H refers to the height on C, while 
in H™ it refers to the height on U.) 

Proof: Let a:,++-+,@, be representatives of cosets of Jx/mJx. 
Infinitely many Peé© lie in the same coset, and so there exists one 
point, say a, and infinitely many points Q¢ Jz such that m@ + a 
lies in ©. We let ©’ be this infinite set of points @. The covering 
wo:jJ— J given by wu = mu +a, is unramified, and its restriction 
U to C also gives an unramified covering of C, which is an irre- 
ducible covering of the same degree as w. The inverse image of 
a point in C lies in U. Thus ©’ is actually a subset of Ux. Re- 
stricting one’s attention to a subset of 6’ guarantees that w induces 
an injection on this subset. 

To prove the relation concerning the height, we may work on 
the Jacobian itself since we take U in the projective embedding 
induced by that of j. If X is a hyperplane section of /, then 
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wo '(X) = (md) X-a, = m°X 


where = is algebraic equivalence. But H,,-1,,, ~ Hx°w by property 
2 of heights (which is trivial), and H.-1,,, is quasi-equivalent to 
Hy Hz’ by Property 4. Our proposition is therefore proved. 

Although it is not difficult to prove that the restriction of our 
covering w to C is of the same degree as w (cf. [15] or [33]) it is 
actually irrelevant here, since we could just as well use one of the 
components of this restriction that contains infinitely many points 
of S’. Such a component must then be defined over K. 

We note that 9(P) = ¢(#(Q)). Let % be the function gow on U. 
Let « be a number >2, and let m be large enough so that m’p > kr. 
Then for a suitable constant c > 0, our inequality becomes 


| $(Q) |» 2 cH(Q)" 


and we can apply Roth’s theorem, because the orders of the poles 
of ¢ on U are the same as those of ¢ on C, since our covering is 
unramified. This concludes the proof of Theorem 1. 


TuHEeorEM 2. Let K, C, y be as in Theorem 1. Let S be a finite, 
non-empty subset of Mx, containing all the archimedean primes and 
assume that K satisfies Roth’s theorem for each ve S. Let R be a 
subring of K all of whose elements are integral for each valuation 
v€S, and let K be the subset of Cx consisting of those points P which 
are not poles of ~, and such that o(P)¢€R. Then the points of ® 
have bounded height. 

Proof: Assume the contrary, and let ®, be a subsequence of 
points of #® whose height tends to infinity. By assumption, we 
have |&|,<1 for v¢S and &€R. Hence for all PER, we get 


H(g(P)) = TI supti, | ¢(P) |)”. 
ves 
Let N=[K: F] and let s be the number of elements of S. Since 
N. S N, we have at most Ns terms in our product, of type 
sup(1, | e(P) |») . 
Consequently, for each Pek, there exists one v in S such that 
| o(P) |, = H(y(P))/** 


Hence there exists an infinite subset 3, of R, such that for some 
véS and all points Pe R, we have 
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le(P)l. 2 Hg(P))™. 


In view of the corollary to Property 4 of heights, we can compare 
H(g(P)) and H(P)). If d is the degree of C in its given projective 
embedding, and ¢ is viewed as a map into P’, then there is a number 
c, > 0 depending on e such that for P in Cg we have 


BP) = co ieP)). 


Combining this with our previous inequality, we see that there is 
a number 9 > 0 such that for some veS and all PeR, we have 
for suitable c, > 0: 


| p(P) |b = eo H(P)P 


and we can apply Theorem 1 to get our contradiction. 

The special cases of interest of Theorem 2 are number fields and 
furiction fields. In the case of number fields, we take for R a 
finitely generated ring over Z, and then we conclude that the set 
of points P such that ¢(P) lies in R, is actually finite. This will 
be generalized to rings of finite type over Z, including transcen- 
dental elements in the next section. 

In the case of function fields, we shall consider a function field 
K over a constant field k of characteristic 0. The set of absolute 
values is that of Example 2, Chapter II, §1. All the conditions 
of Theorem 2 are satisfied, except that /x/mJx need not be finite. 
However, in view of the fact that the height is geometric, i.e. 
does not depend on the constant field (Chapter III, Proposition 6, 
§ 3), we may go over to the function field Kk over k, i.e. we may 
without loss of generality assume that & is algebraically closed. 
In that case, we can apply Theorem 2 of Chapter VI, taking into 
account that B, (and hence rB,;) is divisible, and thus that Jx/mJ/x 
is indeed finite. 

In the next section, we shall give a more precise description of 
our sets of bounded height and sets of integral points. 


§2. Finiteness theorems 


If K is a function field over a constant field k we shall pick a 
model as in Example 2 of Chapter II to compute heights. If C is 
a complete non-singular curve defined over K, and a subset of 
points of Cx has bounded height in some projective embedding, 
then we recall that it has bounded height in every projective em- 
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bedding (say by Property 4 of heights). 


Proposition 2. Let K be a function field over a constant field k 
of characteristic 0. Let C be a complete non-singular curve of genus 
>1 defined over K. Let X% be an infinite subset of Cr of bounded 
height. Regard C as embedded in its Jacobian J over K, and let 
(B, rt) be a K/k-trace of J. Then t is an isomorphism. The points 
of K lie in a finite number of cosets of tBy. If infinitely many of them 
lie in one coset, and so are of type a+ tb where a is some point of 
Jx and b ranges over an infinite subset of By, then Cy = *(C-«) 
is defined over k, and t induces an tsomorphism of Cy onto C-a. 

Proof: We may assume that / and B are embedded in projective 
space. We see from the Corollary of Theorem 6 of Chapter VI, 
§5 that the points of ® lie in a finite number of cosets of rB,. 
Assume that infinitely many lie in the coset @a+tB,. We know 
that rc establishes an isomorphism between Band 7tB by AV, Corol- 
lary 2 of Theorem 9, Chapter VIII. Since infinitely many points 
of tB, lie in C_a, it follows that their K-closure in rB or J is 
precisely C_.. Put Cy) =t ‘(C_a). Then C, is a curve contained in 
B, and tr induces an isomorphism t) of C, onto C-.. Furthermore, 
C, contains infinitely many points b of B rational over k. It is 
then a trivial matter to conclude that C, is defined over k, because 
these infinitely many points are both k- and A-dense in Cy. Since 
tB contains a translation of C, it follows that rB = J is the Jacobian, 
i.e. tT iS an isomorphism. 

Remark. \f we do not assume characteristic 0 in the preceding 
proposition, then rt is merely bijective, and C, may be defined over 
a purely inseparable extension of &. 


Corouttary. Let K, k be as above. Let C be a complete non-singular 
curve of genus =2 defined over K, and let R be an infinite subset 
of Cx consisting of points of bounded height. Then there exists a 
curve Cy defined over k and a birational transformation T:C,—>C 
defined over K such that all but a finite number of points of ® are 
images under T of rational points of Cy in k. 

Proof: Since the genus is 22, the curve cannot be equal to any 
translation of itself in its Jacobian. (If it were, so would the 
divisor 9, and it isn’t, even up to linear equivalence, by AV 
Theorem 3 of Chapter VI, §3.) Hence there can only be one coset 
having infinitely many points of C, and we apply the proposition. 
Combining our corollary with the results obtained in the previous 
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section, we get: 


TueoreM 3. Let K be a function field over a constant field k of 
characteristic 0. Let R be a subring of K of finite type over k. Let 
C be a complete non-singular curve of genus =1 defined over K, and 
y a non-constant function on C, also defined over K. Let ® be the 
subset of Cx consisting of those points P such that o(P)eR. If ® 
ts infinite, then there exists a curve Cy defined over k and a birational 
transformation T:C,—C defined over Kk. If the genus is =2, then 
all but a finite number of points of R are images under T of points 
of Cor. If the genus is 1, then the points of lie in a finite number 
of cosets of T(Cox). 

Our next theorem gives the absolute result. 


TueoreM 4. Let K be a field of finite type over Q, and R a subring 
of K of finite type over Z. Let C be a projective non-singular curve 
of genus =1 defined over K, and let » be a non-constant function 
in K(C). Then there is only a finite number of points PE Cx which 
are not poles of » and such that o(P)€ R. 

Proof: Vf R is algebraic over Z, this is simply Theorem 2 of 
§1 together with the finiteness of points of bounded height in a 
given number field. We shall reduce the general case to this one 
by a specialization argument, made possible by the previous theorem. 

Suppose that R is infinite. Let Rk be the algebraic closure of Q 
in K. Then by Theorem 3, C is birationally equivalent over K to 
a curve C, defined over &. If the genus of C is 1, we restrict our 
attention to an infinite subset of points of R which lie in the same 
coset of 7(Co,). Then without loss of generality, we may assume 
C=C), and that we have infinitely many points of C in K such 
that g(P)¢ R, where ¢ is a function defined over K. We shall now 
prove our theorem by induction on the dimension of K over Q. 

Let F be a subfield of K containing k such that the dimension 
of K over F is 1. There exists a discrete valuation ring » of K 
containing F and R whose residue class field E = 0/m is finite over 
F and such that the reduction ¢’ of ¢ mod m is a non-constant 
function o’: C—>P' (of the same degree as ¢). For any point Q 
of Cx, we get a specialized point Q’ in C, and ¢'(Q’) = ¢(Q)’' using 
the compatibility of intersections and reductions, i.e. formally, using 
the graphs: 


[To-(Q x P|’ =1e-Q x P’) 
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the left-hand side being Q x ¢(Q) and the right-hand side being 
Q’ x ¢'(Q’). This yields infinitely many points Q’ of Cg such that 
¢'(Q’) lies in the ring R’, the image of R in the homomorphism 
p—o/m. Since £ is of finite type over Q, of dimension one less 
than that of K, and since R’ is still of finite type over Z, this 
concludes the proof. 


§3. Theorem of decomposition 


Throughout this section, K is a field with a proper set of absolute 
values Mz. V is a complete, normal variety defined over K. All 
points, functions, and divisors on V will be assumed to be rational 
over K. All places to be considered will be assumed to be K-valued 
places of K(V) over K. We use v to denote absolute values in Mg 
and w to denote absolute values in Mz. 

If y is a function on V (and thus in K(V) according to our con- 
ventions), and z a place, then either z(y) is c% in which case we 
put 


| 7(@) |w == 00 we Mrz 


or z(y) is finite and | z(¢) |. has the usual meaning. We agree that 
co is greater than any real number, and that as usual, 1/0 = o 
while 1/o = 0. 

If P is a point which does not lie in the divisor of poles of 9g, 
then for any place zp which induces P on V, ze(¢y) is the same 
and is equal to y(P). If P does not lie in the divisor of zeros of 
y, then zp(y) = © if P lies in the divisor of poles, and is finite 
+0 if P lies neither in the divisor of zeros or poles of 9, i.e. if 
P € supp(v). These criteria will be applied constantly in the sequel. 
(Cf. IAG, Proposition 3 of Chapter VI, §1.) 

By a distribution we shall mean a function d(z,w) of places z 
and absolute values w which can be expressed in the form 


d(x, w) = sup, inf; | 2(¢:;) |w 


where ¢,,(@=1,---,7 and j =1,---,s) are functions on V. We 
then write D = dy = dy,, and we say that the distribution is attached 
to the set 9.;, or ¢. 
It is clear that if d(z, w) is a distribution attached to the set 9;;, 
then »(z, w)~* is also a distribution, attached to the set $j; = 1/9;;. 
If d(z, w) and »/(z, w) are two distributions, attached to the sets 
gi; and ¢y, respectively, and if their product is defined (i.e. in no 
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case is one of the factors 0 while the other is 0) then the product 
d(z, w)d'(z, w) is also a distribution, attached to the set wiy,j. = 
isfy, 1.e. the product is equal to 


supi,v inf;,4 | 7Oishvu) lw . 


This assertion is an immediate consequence of the definitions. 

A distribution 0 is said to be rational over a field E (algebraic 
over K) if d= De,, for some set of functions ¢;; rational over E. 

Let d be a distribution and P a point of V. We shall say that 
d is defined at P if it satisfies the following conditions: 

(1) D(z, w) is the same for all places z inducing P on V, and is 
then denoted by d(P, w). 

(ii) 0d(z,w) # co for any w, and if dis rational over a finite extension 
E of K, and P is also rational over E, then 6(z, w) depends only 
on the absolute value induced by w on E, and as such is an M,- 
divisor. 

The next proposition gives an important criterion for a distribu- 
tion to be defined at a point. 


Proposition 3. Let T be a divisor on V and assume that we can 
write 
(Gy) = Xi — Ye + T 


where X;, Y; are divisors 20, and the Y; have no point in common. 
Then the distribution dy is defined at any point P € supp(T), and 
either P is a common point of all the Xi, in which case dy4(P, w) = 0 
for all w, or the X; have no point in common, and then 


dP, w) = supy inf j | Giri P) |w #0, 00 


where i’ ranges over the indices such that P € supp(Xi+) and j' ranges 
over the indices such that P € supp(Y;:). 
Proof: Assume P € supp(T). Given an index 7, we have two 


cases: 
Case 1. P € supp(X;). Then for some index j’ we know that 


P¢supp(Y;-) and hence 9;;(P) is defined. Thus 
inf; | wp(Gis) lw = inf | gis CP) |w - 


Case 2. Pesupp(X;). Then inf; | z,(¢i;)|~=0. Taking the sup 
over i, our assertions are now clear. 


Corottary. Let the notation be as in the previous proposition and 
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assume in addition that T is positive. If Pésupp(T), then dg ts 
defined at P and equal to 0. 

We recall that if c is an Mx-divisor, we put c(w) = c(v) for any 
we Mg extending some ve Mg. 

Let b and »’ be two distributions. They will be said to be eguiva- 
lent (and we write )~ bd’) on a set of points U if they are both 
defined on U, and if there exist two Mgx-divisors c, and cz, such 
that 


c(w)d(P, w) S d'(P, w) S c2(w)do(P, w) 


for all Pe U and all we Mz. We shall give a criterion for two 
distributions to be equivalent. For this purpose, we need a lemma. 


Lemma 1. Let T be a divisor on V, T=0, and let i; be func- 
tions such that 


(gi) =Xi— Y;+ L 


where the X;, Y; are positive divisors, the X; have no point in common, 
and neither do the Y;. Then there exists an Mx-divisor c such that 
for all points P we have for all we Mg 


Do(P, w) S c(w). 
Proof: It will suffice to prove that for each 2, 
inf; | z(~i;) |w S clw) , 
and this relation is equivalent to 
sup; | z(1/¢i;) |w = c(w)” . 
On the other hand, we have for each ij, 
(V/gis) = Y¥; -Xi-T. 


Since T 20, our functions 1/y;; have the property of Lemma 1, 
Chapter IV, §2 and we are finished. 


Corottary. Jf T=0 in the lemma, then dg is equivalent to 1. 

Let T be a divisor on V, and assume that we can find functions 
yi; and positive divisors X:, Y; such that the X; have no point in 
common, the Y; have no point in common, and 


(gi) = Xi—- Y;4+T. 


We shall then say that we can attach a distribution to T. If dyn, 
X;, Yu is another set of such functions and divisors, then bd, and 
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Dy are equivalent on V—supp(T). Indeed, each one is #0, #0 
and d,/dy, is the distribution >. where oiy,jv = 9ij/Puv. Thus 


(Wi, iv) = X;+ Ge = vs _— ye 


(we may obviously assume that none of the ¢;; or dy, is the function 
0). We can therefore apply the corollary of Lemma 1 to conclude 
that d. is equivalent to 1, and hence that by and by are equivalent. 

We are interested in distributions only up to equivalence. If we 
can attach a distribution to a divisor T, we shall denote by d7 any 
distribution in the equivalence class of distributions dy, obtained in 
the above manner. 

We shall now list systematically the properties of distributions 
associated with divisors. 


Property 1. Jf T=0 and we can attach a distribution to T, then 
we can take d7 S1, and dr is defined and >0 on the complement of 
supp(T). It is defined and equal to 0 on supp(T). 

This follows immediately from Lemma 2, from which we see 
that inf(d,, 1) is equivalent to dy. 


Property 2. If T and T' are two divisors to which we can attach 
distributions Dr and dr respectively, then we can attach a distribution 
to T+ T' and 


Orin ~ Drdr- 


on the complement of supp(T) Vv supp(T'). 

This assertion is obvious, simply by taking the products of two 
sets of functions ¢;; and ¢y, giving rise to the distributions 07 and 
Dr respectively. 


Property 3. Let f be a function on V. Let U be the open set 
where f is defined. Then we have a distribution d; defined on U, and 
such that for each Pe U, we have 

dP, w) = |f(P) Iw. 
This is trivial, putting all 9; =f. 
Property 4. Jf T, T' are two divisors to which we can attach distri- 


butions dr and dz respectively, and if they are linearly equivalent, 
i.e. if there is a function f such that 


Gal =F 
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then d+ is equivalent to dz/dr on the complement of supp(T)U supp(T'). 
Indeed, if y;; are used to define by as above, then we can clearly 
use the functions fy,, to define dz. 
Property 4 is known as the decomposition theorem. 


Property 5. If T,,--:, Tm are positive divisors on V to which we 
can attach distributions, then supsdr, is a distribution. If the T) 
have no point in common, then sup,dr, 1s equivalent to 1. 

Suppose that we have for each index 4, 


@:,.3) = Xs = A + TS 


as usual. Let the pairs (4, 7,) form a new set of indices v, and 
the pairs (4,j,) form a new set of indices uw. Then 


supadr,(z, Ww) = Supa SUPi, inf;, | (Pin. 5a) i 
supa sup,, infa inf;, | 7(Pvu) |w 
supy inf, | 7(Pyu) |w 


where $y, = %:,,;,. This shows that our supa dy, is a distribution. 
If the 7, have no point in common, then by the Corollary of Lemma 
1, this distribution is equivalent to 1. Indeed, 

(fy) = XY — Yu 
where Xj = X;, + T, and Y, = Y;, so our assumptions imply that 
the corollary applies. 

Finally, we shall give a criterion for the possibility of attaching 
distributions to divisors. If T is a divisor on V, and K; its small- 
est field of rationality containing K, we shall say that we can 
attach a distribution to T rationally over Kr if the functions 9;; 


and the divisors X;:, Y; of Proposition 3 can be selected rational 
over Kr. 


IV 


TueoreM 5. Let V be a projective non-singular variety defined over 
the field K having a proper set of absolute values Mg. Then for every 
divisor T on V rational over K, one can attach a distribution ratio- 
nally over Kr. 

Proof: We just use Lemma 2 of Chapter IV, §3 so that T+ X ~ 
Y where X, Y are both ample. We can choose them both rational 
over Kr. Then we simply pick hyperplanes X;~ X and X;~ Y 
rational over Kr and having the required properties (i.e. the X; 
without points in common, and the Y; also). (Of course the X; 
and Y; are hyperplanes in different projective embeddings.) 
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As mentioned previously, the main application of the theorem of 
decomposition will be to curves. If we translate Properties 4 and 
5 to a context where all the absolute values of Mx are discrete, 
we get: 


TueoreM 6. Let K be a field with a proper set Mx of absolute 
values, all of them discrete. Let V be a complete non-singular curve 
defined over K. To each prime rational divisor T of V over K, and 
to each point PE Vx, one can associate an integral Mx-ideal a7(P) 
having the following property: If is a function in K(V), and 


(gy) = m1 


is its divisor where the T, are distinct prime divisors of V rational 
over K, then there exist two Myx-ideals 6, and b,, depending only on 
gy, such that for any P€ Vg which is not a zero or pole of ~, we 
have 


b, + ay mar, (P) <= (o(P)) Sh + De mar,(P) ' 


If T and T' are distinct prime divisors, then there exists an integral 
Myx-ideal 6 such that for all PE Vx we have 


0S inflar(P),ar(P)] Sb. 


It is convenient to reformulate our notion of equivalence in the 
special case of Theorem 6. Therefore let K and V be as in this 
theorem. Suppose that we have a mapping a from Vx to the Mx- 
ideals, so for each Pe Vg, a(P) is an Mg-ideal. Two such maps 
a,a’ are called equivalent if there exist two Mx-ideals 6,, 6, such 
that for all Pe Ve we have 


b + a(P)sa(P)sh+a(P). 


In Theorem 6, we see that our maps @7 preserve the usual opera- 
tions of divisors and ideals, i.e. the sum, sup, and inf, up to equiva- 
lence. 

We observe finally that the theory of distributions can be de- 
veloped without a “constant field” as the beginnings of it are done in 
[40]. One could also formulate the decomposition theorem in that 
case, i.e. essentially for a variety scheme all of whose fibers are 
also varieties and for absolute values on the function fields of these 
fibers. We leave this as an exercise to the reader. 
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§4. Curves of genus 0 


As a typical example of the arguments which follow, let us 
consider a curve 


ax+by=1 


with coefficients a,b #0 in a field K of characteristic 0. Let I be 
a finitely generated multiplicative subgroup of K*. Then there is 
only a finite number of solutions of the above equation with x, yel. 

Proof; Let » be an integer 23. If there were infinitely many, 
then there would be infinitely many in one coset of [’/I”, which 
we could write in the form 


y= ae. and \ = Dip oe 
This would give infinitely many solutions in 7 x I of the equation 
aa,é" + bb,” =i) 


which has genus 21. We can view these solutions as lying in the 
ring finitely generated by I over Z, thereby contradicting Theorem 
4 of §2. 

By a linear torus (torus for short) we shall mean a group 
variety which is isomorphic to a product of multiplicative groups. 
We note that given a finitely generated subgroup of a commutative 
group variety, there always exists a field of finite type over the 
prime field over which all the points of this subgroup are 
rational. 


TueorEM 7. Let G be a torus in characteristic 0. Let T be a 
finitely generated subgroup of G, and let C be a curve contained in 
G. If CnT ts infinite, then C is the translation of a subtorus. 

Proof: Let K be a field of finite type over Q such that G is 
isomorphic to a product of multiplicative groups over K, and such 
that all points of 7 are rational over K. Let (m, ---, xn) be a generic 
point of C over K, each x; lying in the multiplicative group. In 
view of Theorem 4, C must have genus 0, and hence K(x, -+-+, x.) = 
K(t) for some parameter ¢t. Thus each x; is a rational function of 
t, say 


ait) . 


Let m be an integer 21. Infinitely many points of C must lie in 
some coset of /7//". Thus there exist elements a;¢ K* such that 
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if we put x; = a;€7", then the curve C’ whose generic point is (£,, ---, En) 
over possibly a finite extension of K has infinitely many points 
in I’, This implies that this curve also has genus 0. From this, 
we deduce that each function ¢,(t) has at most two singularities 
(zero or pole). Indeed, a curve defined by an equation 


E"=ag(t), aeK* 


where g(t) is a rational function of ¢ has genus =1 as soon as m 
is sufficiently large, and relatively prime to the order of the zeros 
and poles of ¢, provided that ¢ has at least 3 singularities. We see 
this from the genus formula 


2g' —2= —2m + Ser — 1) 


each ramification index being equal to m in this case. Thus the 
term 5) (er — 1) grows at least like 3(m — 1) as m tends to infinity, 
and the genus does become large. 

Choosing our parameter ¢ properly, we may write say for 7 = 1, 


x= Gites 


for some integer 7, # 0. We now contend that each x; can be written 
a;t"*. This will prove our theorem. 
Our covering C’ of C contains as an intermediate covering the 
curve defined by the equation 


Ba ee) 


which must be of genus 0. This implies that ¢"'y,(¢)*’ has at most 
two singularies, and hence clearly that ¢;(t) = a:t" for some integer 
vy; Our theorem is proved. 


TueoreM 8. Let K be a field of finite type over Q, and R a subring 
of finite type over Z. Let C be a complete non-singular curve of 
genus 0, defined over K, and let ¢ be a non-constant function in K(C). 
Suppose that there are infinitely many points of Cx such that 9(P) 
lies in R. Then ¢ has at most two distinct poles. 

Proof: Suppose ¢ has at least three distinct poles. We can select 
a model of C such that none of the poles is at infinity. This means 
that K(C) = K(t) for some transcendental element ¢ over K, and that 


g(t) = fOle® 


where f and g are polynomials in K[¢], g(t) has leading coefficient 
1 and at least three distinct roots a1, @2,a3;. After making a finite 
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extension of K, we may assume that these three roots are in K. 
Without loss of generality, we may assume that K is the quotient 
field of R (adjoin a finite number of elements to R) and that FR is 
integrally closed (Proposition 2 of Chapter H,§1). If there are 
infinitely many points P in Cx such that 9(P)¢R, then for such 
points P, the distribution aa,(P) as in Theorem 6 is equivalent to 
0 (additive notation). This means that the function (¢ — a;)/(t — aj), 
for 1 #j, which has no zero or pole except a; and a;, is such that 
for any value ?t’ of t in K, # a; or aj, the Mr-ideal of (t’ — a;)/(t’ —a;) 
lies between two fixed bounds. Excluding the primes occurring in 
a finite set (Proposition 5 of Chapter II, §4) and using Theorem 5 
of Chapter II, §4 we see that (¢t’ —a,)/(t' — a@;) lies in a finitely 
generated group. 
Now we have Siegel’s identity: 


As — Oy t—a, a3 — ay ae 
Thus from our assumptions, we get infinitely many points in a 
finitely generated multiplicative group on a curve of the type con- 
sidered at the beginning of this section, contradiction. 

Knowing that our function ¢ has only two poles, we can then 
describe it further. On the one hand, these two poles may be both 
finite, determined by a, @ where a is a quadratic irrationality over 
KK hus 


gt) = FHlg@) 


where g(t) is a power of (¢—a)(t— @), essentially a norm. One 
could use the decomposition theorem once more to analyze this 
situation. We leave it to the reader. If we make a quadratic 
extension of A in this case, or if all the roots of g are in K, then 
we can choose our parametrization of C by ¢ in such a way that 
one of the poles is at infinity, and thus ¢(¢) takes the form 


p(t) = fld)/t" 
where /(¢) is a polynomial and m is an integer =0. The decom- 
position theorem again shows us that the only possibility for integral 
points are the obvious ones. 
We could also analyze the situation first in the relative case, in 
order to reduce it afterwards to the absolute case or the case of 
number fields. Let us just state one more result. 
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TueorEM 9. Let K be a function field over the constant field k of 
characteristic 0, and let R be a finitely generated subalgebra of K 
over k. Let C be a complete non-singular curve of genus 0 defined 
over K, and ~ a non-constant function in K(C). If © has at least 
three distinct poles, then the points P of Cx such that o(P)€ R have 
bounded height. 

Proof: Just as in the preceding theorem, we end up with an 
equation ax + by =1 and solutions in R* (after possibly extend- 
ing R to a larger ring). We can then proceed as before in the 
reduction to curves of genus =1 to get what we want. 

We leave it to the reader to show that elements of bounded height 
in A* lie in a finite number of cosets of &*. Our theorem reduces 
the study of integral points on C to such points of bounded height. 


Historical Note: 


It was Siegel [37] who showed how one can put the Mordell-Weil 
theorem together with diophantine approximations (the Thue-Siegel 
theorem) in order to prove that a curve of genus =1 has only a 
finite number of integral points. Since at that time he did not 
have the strong form of Roth’s theorem he had to surmount con- 
siderable difficulties in the diophantine approximations because of 
the lack of uniformity. 

He actually worked over the ring of integers of a number field. 
Mahler [21] noted that by using his p-adic version of the diophantine 
approximations, one could allow a finite number of primes in the 
denominator, at any rate for curves of genus 1 over the rationals. 
He conjectured that the same result would hold for curves of genus 
=1 over number fields. 

Once the theory of the Jacobian was sufficiently developed and 
once Roth’s theorem was obtained, it was then natural to reconsider 
this question. The general version used here was presented in [18] 
following Siegel’s (and Mahler’s) method. The Jacobian replaces 
the theta functions, as usual, and the mechanism of the covering 
already used by Siegel appears here in its full formal clarity. It 
is striking to observe that in [15] I used the Jacobian in a formally 
analogous way to deal with the class field theory in function fields. 
In that case, Artin’s reciprocity law was reduced to a formal com- 
putation in the isogeny u— u” — xu of the Jacobian. In the present 
case, the heart of the proof is reduced to a formal computation of 
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heights in the isogeny u— mu + a. 

The decomposition theorem and the theory of distributions were 
created by Weil in his thesis [41], and we have followed the ex- 
position he gave in [40], taking into account our further axiomati- 
zation of it. Siegel used it to handle curves of genus 0, but in 
that case made a direct analysis of the functions, which have a 
natural factorization, not only into divisors. In the theorems con- 
cerning curves of genus 0 we also follow Siegel [37], transporting 
his proofs to our more general case. The fact that Siegel’s theorems 
for curves of genus 0 or 21 have a relative formulation was pointed 
out in [18]. An analysis of the classical proofs shows among other 
things that they must be separated into two parts: One, proving 
that certain sets of points have bounded height, and second showing 
what sets of bounded height look like. 

As Siegel already observed, the coset argument is formally the 
same when dealing with curves of genus =1 or subgroups of multi- 
plicative groups. The analogy between toruses and abelian varieties 
was again observed by Chabauty, who in two papers [5] and [6] 
considers the infinite intersections of a subvariety of a torus or an 
abelian variety with particular subgroups of finite type, namely 
subgroups of units and groups of rational points in a number field, 
respectively. Thus one is led to generalize and reformulate a con- 
jecture of Chabauty [5] in the following manner. 

Let G be a torus (resp. an abelian variety) in characteristic 0. 
Let V be a subvariety of G having an infinite intersection with a 
subgroup of finite type I of G. Then V contains a finite number of 
translations of group subvarieties of G which contain all but a finite 
number of points of Vol. 

This statement was proved in § 4 for the case when V is of dimen- 
sion 1 and G isa torus. One may ask whether it would not be valid also 
for a commutative group extension of an abelian variety by a torus. 

Let us consider the statement when G is an abelian variety A 
and V is a curve. It then generalizes a classical conjecture of 
Mordell [23] to the effect that a curve of genus =2 over the rationals 
has only a finite number of rational points. 

As for integral points, I would conjecture that Siegel’s theorem 
in its absolute and relative form should be valid for abelian varieties, 
namely: If A is an abelian variety in characteristic 0, defined over 
a field of finite type K, if U is an affine open subset of A, and R 
a finitely generated subring of K over Z, then U has only a finite 
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number of points in R. In the relative case where K is function 
field over a constant field k of characteristic 0, and R a finitely 
generated subalgebra over k, the conjecture states that integral 
points of U in R have bounded height, and thus lie ina finite number 
of cosets of tBy. 

In this connection, the Mordell conjecture becomes a conjecture 
in algebraic geometry, and it is worthwhile to make further com- 
ments on it here. Let k be as above, K = K(t) a function field 
over k, where ¢ is the generic point of a variety T, and let C=C, 
be a curve of genus 22, defined over K. Then C; can be viewed 
as a generic member of an algebraic family. The conjecture asserts 
that if C, has infinitely many rational points in k(t) (cross sections 
of the parameter variety T in the graph of the family), then C;, ts 
birationally equivalent over k(t) to a curve Cy defined over k, and all 
but a finite number of these points come from points of C, ink. In 
particular, the graph is birationally equivalent to a product. 

Evidence for this comes from the special case when C; = C, is 
already defined over k, and then one obtains a classical theorem 
of de Franchis, to the effect that given a variety V and a curve C 
of genus =2 (in characteristic 0), there exists only a finite number 
of generically surjective rational maps of V on C. We give a quick 
proof of this theorem. Taking a generic hyperplane section of V 
and inducing the rational map on it, one reduces the theorem to 
the case when V is itself a curve. Indeed, two distinct generically 
surjective rational maps f,f’: V—C induce distinct generically 
surjective maps on the hyperplane section, as one sees by taking 
the induced homomorphisms on the Albanese varieties, using AV, 
Theorem 4 of Chapter VIII, § 2. 

Assuming now that V is a curve, we have the formula for the 


genus 
2g(V) — 2 = d[2g(C) — 2) +4 


where 220. Thus the degree of V over C is bounded. Taking 
suitable projective embeddings, we see that the degree of the graph 
of our rational maps f must be bounded. Hence these graphs I's 
lie in finitely many algebraic families on V x C. On the other hand, 
a generic element of such families is likewise a generically sur- 
jective rational map of V onto C (as one sees by projecting on both 
factors). Taking the induced homomorphisms on the Jacobians, 
and using the fact that an abelian variety has no algebraic family 
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of abelian subvarieties, we see that all induced maps coming from 
the same family differ by translations. We now use the fact that 
C is not equal to a non-zero translation of itself in its Jacobian. 
We conclude that a graph J’, actually must constitute by itself a 
maximal algebraic family on V x C, and finally that there is only 
a finite number of such graphs, or maps f. This concludes the 
proof. (When V is a curve, we do not need characteristic 0, only 
the assumption that the map f/: V-—C is separable, to be able to 
use the genus formula above.) 

The Mordell conjecture thus gives rise to diophantine criteria for 
lowering fields of definition, and we actually proved such a criterion 
in the context of integral points (Proposition 2). 

Finally, let us mention Chabauty’s partial result. He proves the 
following statement in [6]: Let C be a complete non-singular curve 
of genus 22 defined over a p-adic field K. Let C be embedded in its 
Jacobian over K, and let I be a finitely generated subgroup of Jr. 
If the rank of I is smaller than (strictly) the genus of C, then Cn I 
is finite. We sketch his argument. An infinite number would have 
a point of accumulation, say P). In a neighborhood of Py, there 
will be infinitely many points of /’°9 C, and thus without loss of 
generality (by the non-archimedean behaviour) we may assume that 
I’ itself lies in such a neighoborhood. The assumption on the rank 
would then imply a linear relation on the differentials of the first 
kind on C or on J (those of C being induced by those of J), which 
is impossible. 


CHAPTER VIII 
Hilbert’s Irreducibility Theorem 


Let k be a field, and f(t, ---,¢,,.X1, +++, X;) a polynomial with 
coefficients in k which is irreducible as a polynomial in r +s vari- 
ables. We ask whether there exist values (t’) of (¢) in & such that 
the polynomial f(t’, X) is irreducible as a polynomial in (X) with 
coefficients in k. We shall exhibit a large variety of fields for 
which this is true. 

Suppose f(t, X) has coefficients in A(£), i.e. is in A(t)|X], and as- 
sume it is irreducible as a polynomial in (X) over A(t). Multiplying 
Ff by a polynomial ¢(¢), to make the coefficients lie in &[t] does not 
change this irreducibility property. If we then divide the result- 
ing polynomial by the greatest common divisor of its coefficients, 
we obtain a polynomial /,(t, X)¢A[t, X] which is irreducible over 
k. Conversely, if f(t, X) is in k[t, X] and irreducible over k, then 
it is irreducible in A(t)[X]. 

Given f(t, X)eR#[X], we denote by U;, (or simply U; if the 
reference to k is clear) the subset of the affine space S; consisting 
of those points (ti, --:, ¢}), with ¢;¢k, at which the coefficients of 
f are defined and such that f(t’, X) is irreducible in k[X] over k. 
Such U;,, will be called basic Hilbert sets. The intersection of a 
finite number of basic Hilbert sets with a finite number of non- 
empty Zariski open subsets of S; will be called a Hilbert subset of 
St. If r=1, then Si =& and the non-emply Zariski open sets are 
the complements of finite subsets of &. 

A field & is called Hilbertian if the Hilbert subsets of S; are not 
empty (and thus are infinite). If R is a subset of k, we shall say 
that R is Hilbertian in k if every Hilbert subset of S; contains 
some element (ti, ---,t}) with tj¢R. A ring will be said to be 
Hilbertian if it is Hilbertian in its quotient field. 

Among other results, we shall prove in this chapter that all rings 
of finite type over the prime ring, and of transcendence degree 
>1 in characteristic >0 are Hilbertian. In particular, every num- 
ber field is Hilbertian. The proof of this fact will depend only on 


elementary calculus. 
It will be shown also that the essential case occurs when 7, =1 
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and s=1, i.e. when there is one parameter and one variable. 
The general case can be reduced to this one by simple formal 
arguments. The essential case will be seen to be equivalent to the 
lifting of rational points in coverings of curves. From this point 
of view one is thus led to diophantine questions. 

In §6 we shall apply the irreducibility property to abelian 
varieties, and show for instance that if an abelian variety or a 
curve of genus =1 is defined over a field of finite type over Q, 
then there exist infinitely many non-degenerate specializations of 
it into a number field which induce an injection on the sets of 
rational points. 

We make one final observation, as Hilbert does [12], concerning 
applications to Galois theory. Suppose that we can construct a 
purely transcendental extension A(t,,---,¢-) of & and a finite Galois 
extension FE of &A(t,,-:-,#,) with group G. If k is Hilbertian then 
we can specialize the (¢) on (t’) in & in such a way that the de- 
composition group remains of the same order as that of G, and 
thus obtain a Galois extension of k with group G. This can be 
done in particular with the symmetric groups over any field k, 
thereby allowing either the possibility of constructing a Galois 
extension with the symmetric group if 2 is Hilbertian or of proving 
that a field is not Hilbertian (for instance, a p-adic field). However, 
it does not seem to be known whether the power series ring in 
several variables over a field (or a unique factorization domain 
with infinitely many primes) is Hilbertian. 


§1. Jrreducibility and integral points 
We consider the case of one ¢ and one X. Let us write 


Flt, X) = ant) X™ + +++ + ao(t) 


with a(t)ek[t]. In studying special values of ¢ in k, we always 
exclude the finite number which make a, vanish. 


Suppose that f(t, X) is irreducible over k(¢). If f has a factoriz- 
ation 


A(X) = alt) i CTD: 


in the algebraic closure of k(t), every value f, of ¢ in R determines 
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a homomorphism 
k[t] > Alto] = k 


which can be extended to the ring generated by the roots a1, +++, @» 
of f (under our assumption that a,(t.) #0). The image of these 
roots is determined up to a permutation. Let aj (i=1,---,m) be 


these images. If we have a factorization 
S(to, X) = G(X ho X) 


in k[X], the coefficients of g) and h) are polynomial functions of 
the a;. Let g and h be the polynomials corresponding to these 
functions, giving a factorization 


Ft, X) = AX)WX) 


in the algebraic closure of A(t). Since f is assumed to be irreduc- 
ible, one of the coefficients of g or h does not lie in k(t), say y. 
Then the ring A[t, y] is the affine ring of a curve C (which may 
be reducible) over k. The factorization f(t, X) = goo gives rise 
to a rational point (fo, yo) on our curve C which has projection fy 
on the first factor. 

If one writes f(t, X) = 9X Wea ) in all possible manners in the 
algebraic closure of A(t), with g and h of degree 21, there will be 
each time a coefficient which does not lie in k(f), whence we get 
a finite number of curves C,,---,Cn as above. Each value ¢ in k 
which cannot be lifted to a rational point of any of these curves 
will be such that f(t, X) is irreducible in RLX]. 

If & is the quotient field of a ring R, and if we replace y above 
by ¢g(t)y for a suitable polynomial g(¢)¢ R[t], we may suppose that 
y is integral over R[t]. Excluding the finite number of points ¢, 
for which ¢(f)) = 0, one sees that y) and ¢(f))y) are rational or irra- 
tional over k simultaneously. If, however, they are in k, then 
g(to)yo will be integral over R, and hence in FR if R is integrally 
closed (as is usual in practice). 

By an affine plane curve C over a field k, we shall mean an 
affine curve whose ring is of type &[t, y] over k, with ¢ transcend- 
ental over k. We do not assume that C is absolutely irreducible, 
so that A(t, y) need not be regular over k. If R is a subset of k, 
we denote by U,,2(C) or simply U:,2 the set of elements t,€ R such 
that there is no point PeC, for which «(P)=t, viewing ¢ as a 
function on C. From our preceding discussion, we have: 
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Proposition 1. Let k be a field, and R a subset of k. Every 
Hilbert subset of k contains a finite intersection of sets of type U:,Rr(C) 
for a finite number of affine plane curves C over k. 

The Hibert subsets of & mentioned here are those determined by 
irreducible polynomials f(t, X) with one ¢ and one X. We shall see 
in §3 that the restriction to one X can be dispensed with. 

Let us consider an affine plane curve over k, as above, with y 
not in &(t). If y is not separable over A(t), then a power y?” will 
be for m large and p = characteristic. If y?” actually lies in A(¢), 
then to find elements of U;,z one analyzes k*/k*? directly. Let us 
exclude this case. We are therefore reduced to studying the case 
where y is separable over A(f) but not in A(Z). 

Let k; be the algebraic closure of k in k(t, y). Then (¢, y) deter- 
mines an absolutely irreducible curve C, defined over k, having a 
conjugate Cy over k not equal to C, if R#,. Every rational point 
of Cin k must lie on both C, and C7, and hence there can be only 
a finite number of such points if C is not absolutely irreducible 
under the hypothesis that y is separable over A(t). In the sequel, 
in trying to find points in U;,2, we can therefore assume that C is 
absolutely irreducible. 

Suppose that we have a variety C of dimension 1 over & and a 
non-constant function ¢ in A(C). If for instance & is of finite type 
over Q, RF of finite type over Z, and C of genus =1 then we know 
that there is only a finite number of points P in C, such that 
t(P)e¢ R. The worst case would therefore occur when C has genus 
0, i.e. is a pure curve. 

In that case, following Siegel, one can apply the analysis of 
curves of genus 0 to estimate the number of bad points, but using 
an idea of Néron, it is just as well to reduce the general case to 
that of curves of genus 21, by means of the following trick. 

We start with our plane curve C, with generic point (t, y) over 
k. If we write ¢ = f(u) for a suitable polynomial f(z) in k[u], then 
we get a new curve C’ with generic puint (t, y,u) over k, which 
will have genus 21 (and in fact arbitrarily high for suitable /). 
We now look for special values wu which cannot be lifted to ration- 
al points (to, Yo, 4) of C’ in k. Of course, to = f(u) is determined, 
and if f is taken to have coefficients in a subring R of k, and um 
is selected in R, then ¢ will also lie in R, so that our integrality 
conditions are preserved . 

In order to find polynomials f we simply use the genus formula 
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for a covering. We have two cases: 

Case 1. The function t has at least 3 singularities (zeros or poles) 
on the complete non-singular curve determined by C. 

We then take m large, relatively prime to the characteristic of 
k, to the degree [k(C): k(t)], and to the orders of zeros and poles 
of ¢t. Then the curve C’ whose function field is k(C)(u) over k is 
absolutely irreducible, and of large genus, because the formula 

29’ —2 = —2m + Sie; —-1) 
shows that the term >\(e; — 1) will grow at least like 3(m— 1), each 
e; being equal to m. 

Case 2. The function t has exactly 2 singularities. 

Then we can find a function x in k(C) such that R(C) = A(x), and 
t = x” with an integer r > 0, prime to the characteristic p of k, or 
t = x" — a for some ae k, and an integer n=0. If R(C) + R(t) then 
r>1 or ~>0. Leaving aside the characteristic difficulties, and 
looking only at ¢ = x", we take for f(w) any polynomial of degree 
m prime to p with m distinct roots to see that the genus of C’ 
again becomes large if m becomes large. 

We have thus shown that any subring R of a number field is 
Hilbertian. In the next section, we shall given an elementary 
proof of this fact. 

In characteristic p > 0, we do not have the finiteness of integral 
points because of the Frobenius automorphism. However it suffices 
to use the Mordell-Weil theorem, which gives us a sufficiently 
strong way of counting points, to see that they are Hilbertian. In 
fact, having obtained our curve of genus 21, we embed it in its 
Jacobian over k. If Jf, is finitely generated, then we found an 
upper bound for the number of points of height <B in Chapter 
V, §4. In practice, if R is a subring of k, the number of points 
of R of height <B is very much larger than a power of log B, 
being in fact of the order of magnitude of B itself (for instance 
for the ordinary integers Z and, as the reader himself will verify 
immediately, for a polynomial ring in one variable over a finite 
constant field). Thus if we have a theory of heights on &, if 
f; C'>FP' is a generically surjective rational map of C’ on P' 
defined over k, and if C’ has genus 21, then we know that for a 
point P’¢C;, we have 

h(f(P')) = h(P')? 
where d is the degree of f (Property 4 of heights, which we are 
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applying in a case where it could be verified more trivially). 

Let us look at polynomials over a finite field. Let R=k,.[w] be a poly- 
nomial ring in one variable over the finite field k) with g elements. 
The number of elements of R of degree <yr is g’ and one also 
sees that we could replace thc degree by the logarithmic height, 
computed with respect to the obvious model. Let C be an affine 
plane curve with generic point (t, y) over the quotient field k of R. 
From the above discussion, we see that if y is separable over (2), 
then there exists a number a, with 0 < a<1 such that the number 
of elements of U;,2(C) of degree <r is 


q’ + O(@”) for r- om. 


If y is not separable, then one must examine this case more closely. 
We shall deal with it in connection with arbitrary ground fields 
kin §4. 


§2. Irreducibility over the rational numbers 


We consider the concrete case of the rational numbers, or rather 
the ring of integers Z. Let (¢, y) be a generic point of an affine 
plane curve C over Q, and suppose y€Q(t). After multiplying y 
by a suitable polynomial in Zt], we may assume that y is integral 
over Z[t]. Thus if any value ¢ (excluding a finite number) in Q 
extends to a point (f, yo) on C with y.€Q, then y lies in Z if ty 
lies in Z. 

We may view y as an algebraic function of ¢ over the reals, and 
as such it has an expansion at infinity: 

¥=¢) ar - +b +e +: 

with coefficients a, b,c,--- that are complex numbers, and ?’”” one 
of the e-th roots of t taken as parameter. We choose in fact ?!/ 
to be real. In that case, if there exist infinitely many values of 
t tending to infinity in R such that ¢g(t) is real, then the coefficients 
are in fact real. Indeed, if one of them were complex, let us con- 
sider the one most to the left, say & The angle of the term 
having & for its coefficient is #0, z. As ¢ tends to infinity, the 
term with this coefficient dominates the series to the right of it. 
Hence there cannot be any cancellations, and for to the values 
g(t) would be complex, contradicting the hypothesis. 
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We now study the integral points on the analytic curve (t, ¢(t)) 
and prove the following theorem, concerning their distribution. 


THEOREM 1, Let g(t) be a function of a real variable having a 
power series expansion 


g(t) = at" + +. +b+ cae + vee 


with coefficients a,b,c,:-: in R, in terms of the real parameter t'”, 
and converging for all sufficiently large values of t. Assume that 
g(t) does not lie in R{t], i.e. is not a polynomial in t. Suppose that 
that there is an infinity of positive integers 


lin << i faye OO 


such thai g(t;) is in Z. Then there exists an integer ij, an integer 
m >0, and a real number 24> 0, such that for all i > iy we have 
tiim —ti >t}. 


In other words, the ¢, become more widely spaced. Asa corollary, 
we get an estimate which we can handle better in practice. 


CoroLuary 1. Let g(t) be as in the theorem. There exists a real 
number a with 0 < a<1 such that the number NB) of ti < B for 
which g(t;) is in Z satisfies the inequality 

INCE) = Bb? 
for all B sufficiently large. 

Proof. Choose 0<8<1. Let N, be the number of integers ?; 
such that ¢;< B®+1, and N, the number of ¢; such that B® < ¢; S B. 
Write N.=sm +m, with s=0 and 0<m<m. By the Theorem 
one verifies immediately, using a crude estimate, that 


B+ sBP <B, 
whence s < B’** and thus 
NB)SM+M, 5 B¥t+1l+mt+B. 
From this one immediately deduces the existence of our number a. 
Coroutuary 2. Let U be a Hilbert subset of Q. Then there exists 


a number a, with 0 < a <1, such that the number of positive integers 
<B lying in U is at least equal to 
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B—B* 


for all B sufficiently large. 
Proof. Apply Proposition 1 of §1. 


Corotuary 3. A Hilbert subset of Q contains infinitely many prime 
numbers. 
Proof. Apply the prime number theorem. 


Corotuary 4. A Hilbert subset of Q is dense for the ordinary 
topology and every p-adic topology on Q. 

Proof. lf f(t, X) is irreducible over Q(t), then so is f(a + 1/t, X). 
Integral values of ¢ tending to infinity and making the specialized 
polynomial irreducible will be such that a@+1/f is close to a@ for 
the ordinary topology. For the p-adic topology, one takes 
fia + tp’, X) with » large. 

In order to prove Theorem 1, we need a mean value theorem 
due to H. A. Schwarz. 


Lemma. Let g(t) be a function m times continuously differentiable 
in the interval t; St Stitm. We suppose that ti < tits < +++ < bitm 
are real numbers (not necessarily in Z). Then there exists a number 
t with ti <t < titm such that 


Lt tf + tf? off) 
ove (tr) = il Len Lian ee Fei O(ti+m) 
m! Ve 


where Vm is the Vandermonde determinant. 
(We see that the numerator differs from V» only by the last 
column). 


Proof. I owe this proof to Walter Strodt. Let F(t) be the 
function 


1 ee t wi v(t) 


Fi) = Pees 
1 Uae cra oan O(ti+m—1) 
iL of EF g(t) 
Then F(¢) is 0 when ¢ is equal to the m points #;, +++, titm-1. For 
some constant C the function 


[VIII, §2] IRREDUCIRILITY OVER THE RATIONAL NUMBERS 149 


G(t) = Ft) — C¢ — ti)(t — tins) +++ & — tiem) 


will also vanish at ¢i:n, that is to say at m+1 points. Hence 
Gt) vanishes at least at one point r between ¢; and tim. But 
G'™ (t) = F™ (f) — m!C (all the other terms will cancel). From this 
we conclude that 


Bry =n . 


One sees immediately that F(t) will have a 0 everywhere in the 
bottom line of its matrix, except for the last term, which will be 
g(r). Hence 


F™ (¢) - oO” (r) Vesey 
where Vm-1 is the small Vandermonde determinant. Since 


‘CG = JUGS) 
Cs a ti) Base Uses 2 Lena) 


and since F(ti+m) is none other than the numerator of the expres- 
sion in the statement of the lemma, we see that our lemma is 
proved, taking into account the fact that the product of Vm-, and 
the denominator of C is precisely equal to Vm. 

Let us apply our lemma, taking for m an integer such that 9’ (tf) 
has no negative powers of 1/t'/’, hence its development is of the 


type 
me 1 
gy! (ft) aa ray tee 


with & real. Since ¢ is not a polynomial in ¢, its derivative gy” 
is not identically 0, and so we may assume € #0 and 4>0. This 
term dominates the series for ¢ large. Hence for sufficiently large 
i, we have ¢”(rt)#0. Observe that the determinant in the 
numerator of the expression in the statement of the lemma is an 
integer. We have just seen that g(r), and a fortiori yg ’(r)/m! 
is small, of order of magnitude ¢*. This implies that V,, is large. 
But V,, is a product of differences of the ¢;, and we can replace 
each such difference by ti+m—ti. This makes V, only larger. 
There will be m(m + 1)/2 such terms. We find therefore 


titm — t; > tY 


with a suitable 4’, taking an m(m + 1)/2-root. 


150 HILBERT’S IRREDUCIBILITY THEOREM [VIII, §3] 


§3. Reduction steps 


We study systematically the following reductions of the Hilbert 
polynomials to the case of one ¢, several X, several ¢ and several 
X, and finite separable extensions. 


a. One ft, several X. 


Let & be a field, and f(Xi, ---, Xs) a polynominal with coefficients 
in k, of degree <d in each X;. We denote the set of such poly- 
nomials by P(s,d,k). Kronecker’s specialization Sz transforms f 
into a polynomial in 1 variable, namely 


Si =e, Ye 


If we have a monomial X/! --- Xs, then Sz applied to this monomial 
yields 


yirtieds---+ig—yaet 


Taking into account the d-adic expansion of an integer =0, we see 
Sa is a bijection of P(s,d,k) on the polynomials in k[Y] of degree 
<d’—1. Furthermore, if f,g and fg are in P(s,d,k) then 


Salf9) = Sf )SAY) . 
Let fe P(s, d, k) and suppose that S.f is reducible: 
Saf(Y) =G(Y)H(Y). 


Then the degree of G and H is <d*—1 and hence G= Sug and 
H=S,h for some g,h uniquely determined by G, H respectively. 
From this we deduce Kronecker’s criterion: 

J ts irreducible in kif and only if for every factorization Sif =GH 
with G = Sag and H = S,h, the polynomial gh contains a monomial 


oxi} beer ci ee 
with pek, ¢ #0, of degree =d for one of the X;. 
Proposition 2. Let k be a field, f(t, X:,---, X;) a polynomial with 


coefficients in k, of degree <d in each X;, and let Sa be Kronecker’s 
substitution. Let 


Saf = JI 9, Y) 


be the factorization of Saf in k(t)[Y] into irreducible factors in Y. 
Then, except for a finite number of values of ty in k, every ty in k 
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such that each gj(t., Y) is irreducible in k[Y] is also such that 
F (to, X1, °**, Xn) =fo is irreducible in k[X],. 
Proof. "Except for a finite number of ¢, we have 


Safo = [I gstto, Y) 


a factorization of Sif) into irreducible factors in k[Y]. Excluding 
still another finite set of ¢, (those for which some coefficient ¢(f) 
in a monomial in Kronecker’s criterion will vanish), it is clear that 
fo will be irreducible. 

In particular, we see that every Hilbert subset of & determined 
by a polynomial /(t, X1, ---,X;) contains a Hilbert set determined 
by a finite number of polynomials g,(¢, Y) with a single Y. 


b. Several ¢, several X 


We consider a polynomial /f(t,, ---,¢,, Xi, «++, X;) aS a polynomial 
in ¢, and y +s-—1 variables ¢:, -+-,t,, X;, -+-, Xs. We specialize ¢, 
to preserve irreducibility, and continue inductively. Thus on the 
Hilbert set U;,, in S; we get a fibering, and we can determine its 
points by succesive consideration of the preceding case. 


c. Finite separable extensions 


We shall prove that if & is Hilbertian, then any finite separable 
extension of & is also Hilbertian. We prove in fact a stronger 
result: 


Proposition 3. Let E be a finite separable extension of a field k. 
Let f(t, X)¢ E(H[X] be irreducible over E(t). Assume that the lead- 
ind coefficient of f in X is 1, and that if o ranges over the distinct 
isomorphisms of E over k, then the conjugates f° of f are distinct 
Put 


Fu, X) = TI f"G, X). 


Then F(t, X) is in R([X] and is irreducible. Excluding a finite 
number of tok, Flto, X) is irreducible over k if and only if f(to, X) 
is irreducible over E. 

Proof. By Galois theory, F(f,X) is in R(f)[X]. If we had 
Fit, X) = G(X)H(X), a factorization over R(t), then we could as- 
sume G and H have leading coefficient 1. We conclude that /(¢, X) 
divides G or H over E(t)[X], say G. But then f° also divides G 
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for each o. As the f% are supposed to be distinct, their product 
divides G, a contradiction unless F =G. We now choose f in k 
such that F(t.,X) is irreducible over &. Then it is clear that 
fits, X) will be irreducible over E, taking into account unique 
factorization in E[X]. 

To apply the proposition even to the case where the conjugates 
f° might not be distinct, we make a simple transformation on /f. 
Indeed, there exists an element ¢ = g(t)¢ E(t) such that if we put 


ot, X) = f(t, X + ¢) 


all the g” are distinct. For instance, the constant term of g is 
f(t, ¢) and the existence of the desired ¢ is obvious. We can apply 
Proposition 3 to g, whose irreducibility over E(t) is equivalent to 
that of f(t, X). In particular, we get: 


Corotuary. Let E be a finite separable extension of a field k. 
Every Hilbert subset of E contains a Hilbert subset of k. 

For a treatment of the purely inseparable case the reader may 
consult [13], where the analogous result is proved. 


§4. Function fields 


In dealing with function fields, one is led by induction to the 
cases of purely transcendental extensions and finite extensions. 
Concerning the first, we show that Hilbert sets contain an ab- 
undance of elements. 


Proposition. Let k be an infinite field and w transcendental over 
k. Let t be transcendental over k(w), and y separable over k(w, t) 
but not in k(w,t). Let U be the Hilbert subset of k(w) determined 
by (t,) as in §1. Let m be an integer =1. In the (w,t) plane, 
there exists a non-empty Zariski open set V such that for any A,.€k 
(except possibly a finite number depending on U,m, V) all the values 
t, of t of type 

ty = T) + Aw — wWo)™ 
with to, WER and (to, Wo) € V are in U. 

Proof. After multiplying y by a polynomial in k[w,t] we may 
assume that y is integral over k[w,t]. We consider the surface with 
generic point (w,t, y) over k, which can be viewed as a covering of 
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the plane (w, ?). 
As we assume y separable over k(w,t), there exists a relation 


gw, t) = Plw,t, Y)fw,t, Y) + Ow, t, Y)f'(w, t, Y) 
where f(w,t, Y) is the irreducible polynomial of y over k[w,t], f’ 
its derivative with respect to Y, and P,Q are in k[w,t, Y] while 
gek[w,t], ¢ #0. Consequently, if (wo, to, yo) is a point of the sur- 
face in k such that (wo, t)) #0, then y has an expansion in formal 
power series: 
y = Deidt — t0)'(w — wo)? 

with c;;¢k. (Geometrically speaking, the covering is unramified.) 

The zeros of g(w,t) form a closed Zariski set, whose complement 
is the open set V.. From now on, we assume that (wo, t)) is in V 
and wp, t€ R. 

If there does not exist a point (wp , 7, ¥) on the surface in & 
above (wo, to), then for all 4.¢, there is no point on the (¢, y)-curve 
in k(w) above 

T + Ao(w — iG) ae 


Indeed, such a point would have polynomials in k[w] as coordinates, 
and the homomorphism of this ring mapping w on wy, would then 
give a point of the surface in k above (wp, Tt). 

Suppose, on the other hand, that (w,¢, 0) is a point of the 
surface in k. If we substitute 


Aw — Wo)” 


for (¢—t.) in the power series, we obtain a homomorphism of 
k{w,t, y] into the power series ring &[[w — wo]]. We have to choose 
2 in such a way that y does not map into k[w — wo] = k[w] (it will 
in any case be integral over &[w]). In other words, the series must 


not stop. 
Since y does not lie in &[w,¢], infinitely many c,; are not equal 


to 0. We can write 
Vim = Che wo) 
td 


= SeA(w — wo)” . 


Each c,(4) is a polynomial in 2. 
If, for some value 4) of 2 in k, all but a finite number of c,(Ao) 
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are 0, then ya,,m considered as a rational function of w has a pole 
at infinity (w= oo) of order equal to the largest » such that 
CA.) #0. But we have 


Sw, To =P Aw — Wo)"; room) — 0 A 


Since m is fixed, still excluding a finite number of 4), we see that 
the order of a pole of ya,.m at infinity is bounded (as a function 
of the coefficients of f, which is fixed throughout). If we choose 
vy, larger than this critical value and such that c,,(4.) #0 (which 
is possible since infinitely many c:; + 0) we get a ya,,m whose series 
cannot stop, because if it did, we would get a bad pole of ya,.m. 
This concludes the proof. 

In case y is not separable over k(w, ?t), then either y?” is not in 
k(w,t) for any 7, in which case it becomes separable for r large 
enough and we can apply our proposition to it; or y?” does lie in 
k(w,t) for some 7, so that we may write 


y?”” = o(w, t) 
with g(w, t)e k[w, t] and we may assume that g(w,t) is not a p-th 
power in &[w,t]. We distinguish two cases: the first, where there 
occurs some power of ¢ or w in ¢g which is not divisible by p, and 
the second, where p divides the exponent of ¢ and w in each term 


of g. In the second case, some coefficient of g in Rk is not a p-th 
power. 


In the first case, take m greater than the degree of ¢. Then 
setting t = Aww” for any 4€k, 4. #0, the polynomial 
g(w, Aw”) 


will have the same set of non-zero coefficients as ¢ (this is the 
Kronecker substitution). If 


ow, t) = Sy cisw't 
then 
g(w, Aw”) = Sreuhiw™ . 


If one of the e;; #0, and p does not divide i or j, then we simply 
pick m so that p does not divide i+ mj. 

The same argument could be applied after making an arbitrary 
translation (wo, t>) on (w, ?). 

In the second case, when p divides each i and j, then we can 
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choose any m greater than the degree of ¢ and again any 4 €k. 
If ci; is not a p-th power in k, then c;;43 will also not be a p-th 
power and hence g(w, 4.w™) will not be a p-th power in &[w]. 

This takes care of the remaining case in our treatment of poly- 
nomial rings over finite fields. Thus, up to a purely inseparable 
extension (left open in our third reduction step) we have proved 
the following theorem. 


THEOREM 2. Let R, be a ring, and R a finitely generated ring 
over Ry. Let k, and k be their quotient fields respectively. Then if 
Ry = Z, or if the transcendence degree of k over ky is =1, R is Hil- 
bertian. 

Actually if Ro is a finite field, k is separable over k, and hence 
our proof in this case is complete. 


§5. Abstract definition of Hilbert sets 


It will be convenient to reformulate the definition of Hilbert sets 
in the context of general rings. As usual, rings are without zero- 
divisors. 

Let R be a ring. As we know, spec (R) has the Zariski topology, 
a basis for the open sets being the subsets spec(Re) with eR, 
& +0, and spec(R:) is identified with the subset of prime ideals 
pb of spec(R) which do not contain €. These spec(Re) will be called 
basic Zariski sets. 

Let S be a finitely generated ring over R. We shall say that it 
is algebraic over R if all its elements are algebraic over the 
quotient field A(R) of R. Its quotient field A(S) is then a finite 
extension of K(R). Assume that S is such a ring. The basic Hil- 
bert set associated with S is the subset Us, or Us;rz, of spec(R), 
consisting of those p having the following two properties: 

(i) There is exactly one point $8 in spec(S) lying above pb. 

(ii) If we denote by A(p) and ($B) the residue classs fields of 
these points, then 


[A(P) : R(p)] = [K(S): ACR) . 


By a Hilbert subset of spec(R) we shall mean the intersection of 
a finite number of basic Zariski sets with a finite number of basic 
Hilbert sets. 

If S is of the above type, then for some £e€ R, & #0, S is integral 
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over Rt = R[1/E], and thus above each pespec(Rz) there exists at 
least one in spec(S). As we are interested only in intersections 
of subsets of spec(R) with basic Zariski sets, we may in practice 
assume that S is finite over R, i.e. integral and finitely generated 
over R. 

Let us recall some facts from commutative algebra. If R is 
integrally closed, and S integral over R, and if pespec(R) and 
$e spec(S) lies above p, then the separable degree 


[R(P) : R)]o 


is at most equal to [K(S): K(R)]. There are cases where this does 
not hold true for the degree. However, if R is a regular ring (i.e. 
one for which each local ring R, is regular) then we do have 


[(B) : k(p)] S [A(S): ACR)] . 


(This could be proved for instance by “blowing up” the given 
point p, i.e. taking a suitable discrete valuation ring » of K, whose 
maximal ideal m restricts to p, and whose residue class field o/m is 
purely transcendental over k(p). The point $ in spec(S) would lift 
to a point in the integral closure of o in AK(S), and we could apply 
the known theorem for valuation rings.) 

In practice, when we deal with rings R, they will be parameter 
rings, that is, we deal with schemes over R, and there will always 
exist an element Fe R, & #0, such that R: is regular (the simple 
points on a variety form a Zariski open set). 

In order to ensure the validity of the conclusion of the next 
proposition, we see from our preceding discussion that we must 
make some restrictions on our rings. However, with such as- 
sumptions, guaranteeing the degree inequality, the proof is obvious. 


Proposition 5. Let S’ > S2R be three rings, with S, S' finite 
over R, and R, S integrally closed. Let p¢€ Us:;z, and let B, ' be 
the points lying above it in spec(S) and spec(S') respectively. Assume 
that S, and R, are regular, or that k($') is a separable extension 
of k(p). Then B is in Us-s, and » is in Us;pr. 

In our applications, R = k[f;,---,¢,] is a finitely generated ring 
over a field k, with ¢,,---,¢, algebraically independent over k. If, 
as in the proposition, we are concerned with a finite extension S’ 
of a ring S, then using the Noether normalization theorem, one 
can find a polynomial ring R so that the search of points in Us-/s 
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is reduced to that of points in Usg’/z. 
Over the ring &[t,, ---, ¢,] we can relate the sets Uys of §1 to our 
sets Use of the previous discussion: 


Proposition 6. Let k be a field, k[t,, ---, t,| a purely transcendent- 
al ring extension R of k. Let S be finite over R. Then there exists 
an irreducible polynomial f(t,,++:,tr,X) such that Us;p contains 


Us,n 0 Z where Z is a basic Zariski set of spec(R). 

(We have identified the point in spec(R) determined by values 
(t’) of (¢) in R.) 

Proof. Without loss of generality, we may assume that & is in- 
PMc Say 8 = KIX), ---, %|. Put 


YHOuX, + SOD Se (Gah'es 


where the c; are elements of k such that if K = K(R) then 
K(%1, +++, %n) = K(y). Such elements can be found by the usual 
arguments used to prove the primitive element theorem of Galois 
theory. Then spec(S) and spec(R[y]) contain a common Zariski 
basic set, and from this, taking for f an irreducible equation of y 
over FR, our assertion is obvious. 

Our rings S over R may be viewed as defining algebraic systems 
of 0-cycles, whose generic members are relatively irreducible over 
the quotient field of R. The Hilbert sets then give points of the 
parameter scheme over which special members of the family re- 
main relatively irreducible. It is an easy matter (taking generic 
projections, as with Chow forms) to extend the definition to alge- 
braic systems of arbitrary dimension, i.e. to assume merely that 
we are dealing with a scheme S over spec(R) which is of finite 
type, without divisors of zero, and whose generic member is ir- 
reducible as a scheme. One would see that the set of p€spec(R) 
for which the localized fiber S, is without divisors of zero and is 
irreducible over k(p) contains a Hilbert subset of spec(R). 

One should not confuse this relative irreducibility with absolute 
irreducibility. For the convenience of the reader, let us recall 
here the standard result concerning the preservation of absolute 
irreducibility. 


Proposition 7. Let R be a ring, K its quotient field, and 
F(X, «++, Xs) #0 a polynomial in R(X] which is irreducible in K(X]. 
Then there exists EER, & #0 such that for any »p € spec(R:) the 
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reduced polynomial f,(X) ts irreducible in k(p)[X }. 

Proof. Vet f be of degree d. Consider two integers m,n2 1 
such that m+n=d. Let Mz (resp. Ms) range over all monomiais 
of degree <m (resp. m) and put 


Gu. = SY uaMlX) 
hy = > 0eMg(X) 


where the ws, va are a set of algebraically independent quantities 
over K. Then 


July = Si vate, v)MA(X) 


where the ¢g(u,v) are polynomials with integer coefficients and are 
“universal.” We can write 


KX) = SoaM(X) acR. 


By assumption, and Hilbert’s nullstellensatz, the ideal generated 
by the ¢ga(u,v) —ca in K[u,v] is the unit ideal, and 1 has a finite 
expression in terms of these generators. Multiplying the cofficients 
by some element of R, we get 


E = BS) dalgatu, v) — ea] 


with Fe R, & +0, and d,€ R[u, v]. 

For each m,n as above, we get such a relation, and we should 
actually index & by m,n, i.e. write En... Taking the product of 
these &,,,. for each pair m,n together with the non-zero cofficients 
of f, we get an element € of R which will obviously satisfy our 
requirements. 


§6. Applications to commutative group varieties 


Let R be an integrally closed ring, and A a commutative group 
variety scheme over spec(R), or over R as we shall say. We assume 
that it is simple over R (this amounts to the classical terminology 
of non-degeneracy). The only properties of such schemes which 
we Shall use are the following: 

If S, c S, are two rings containing R, and if A(S,), A(S,) denote 
the points of A in S,; and S,, respectively, then the natural map 


A(S;) > A(S:) 
is an injection. We identity A(S;) in A(S,). 
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For each péspec(R), the fiber Ay over p is a commutative group 
variety scheme over the residue class field A(p). If x¢ A(R) and 
x(p) denotes the corresponding point of A, in k(p), then the map 


=> XD) 


is a homomorphism of A(R) into A,(A()). 

If S> R, PEspec(S), and m an integer >1, prime to the char- 
acteristic of A(}), then the map x—x(§) defines an injection on 
the subgroup of points of A(S) consisting of the points of period m. 

If K is the quotient field of R, E a finite extension of K, and 
ac A(K), then there exists a subring S of E, finitely generated 
over R, such that ae A(S). 

Given a point c’ of period m in A,(k(p)), there exists a point c 
of period m in some A(S) for S finitely generated and algebraic 
over R such that c’ = c(%) for some Pespec(S) lying above p. 

If b€ A(E) for some extension EF of K, then there exists a smal- 
lest field K(6) of rationality for 6 containing K. If b€ A(S), where 
RcSc Kb) and $espec(S) lies above p, then RB) = R(p)(OCP)). 


Lemma 2. Let R be integrally closed and A a commutative group 
variety scheme over R. Let a€ A(R) and let m be an integer >0 
brime to the characteristic of K. Suppose that a€émA(K). Then 
there exists a ring S, finitely generated and algebraic over R, such 
that for all pé Us;r, the point a(p) does not lie in mA p(R(p)). 

Porof. Since a¢ mA(K), the field K(d), for any Sint be A(K) 
such that mb =a, has degree >1 over K. We may assume that 
all these points lie in A(S) where S is finitely generated and alge- 
braic over R. Replacing R by R[1/E] for a suitable & + 0in R, we 
may assume that S in finite over R. We also adjoint 1/m to R. 
We see then that for pé Us;e and $espec(S) lying above »p, the 
extension 


R(pyoP)) of Ri) 


has degree >1. By hypothesis, the map b—d() induces a bijection 
of the m-th roots of @ onto the m-th roots of a(p). Hence a(p) can- 
not lie in mA,(A(p)). 

If 7 is an abelian group, we denote by 7’,, the subgroup of points 
of period m. 


TueorEM 3. Let R be an integrally closed ring, and A a com- 
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mutative group variety scheme, simple over R. Let m be an integer 
>1, prime to the characteristic of the quotient field K of R. Let I 
be a finitely generated subgroup of A(R). Assume: 

(i) The period of each torsion element of I divides m. 

(ii) The canonical map T'|mI’' > A(K)/mA(K) is an injection 
(denoting by ['' the group T + A(K)m). Then there exists a Hilbert 
subset U of spec(R) such that for p¢ U, the map 


4 > Xp) 


‘induces an injection on TI. 

Proof. Extending R by a finite number of elements of type 1/& 
with —€ #0 in R, we may assume that 1/meéR and that all points 
in A(K) of period m lie in A(R). We may then assume, without 
loss of generality, that these points are contained in J” (consider- 
ing I’ instead of I’). 

We can write I’ as a direct sum of a free abelian group B and 
a finite torsion group C. Let ™,---,x, be a basis for B over Z. 
Let {a;} be a set of representatives in B of the factor group B/mB, 
whose coordinates in Z are the integers a such that OSasSm-—l1 
or 0S —a<xm-—1. We consider the finite set of points a; +c, 
ceéC. By assumption, we can determine for each such point a 
Hilbert set as in the lemma. Further, we can find a ring S finite- 
ly generated, algebraic over R, such that every point z¢ A(K) with 
mz=0 is in A(S). This condition implies in particular that if 
pe Us;r and K(z)# K then 2(p) is not rational over k(p). We let 
U be the intersection of the finite number of Hilbert sets just 
mentioned. 

Let pe U and let xe€I be such that x(p)=0. To show that x=0. 
we may assume that x¢ B and that x(p) has period m and derive 
a contradiction. By construction, x(p) cannot be equal to z2(p), where 
zisa point of period m in A(K) and not in A(K). Hence it must 
be of type c(p) with c€C. Consequently, 


x(p) — cp) = 0. 
For some a; and some ye B, we have 
xX=ajt+my 
and hence 
0 = a;(p) — c(p) + my) . 
By construction, a; —c =0, and hence y(p) has period m. 
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If we write x= Aim, + +++ + Ant, With 4;¢Z, then we can express 
y in terms of our basis x, ---,%, in the form 


Y= M1%1 am OOD + LnXn 


where my; = 4; — a:, the a; being the coordinates of a; with respect 
to our basis. If x #0, then repeating the procedure a finite number 
of times, we get a;(p) + c(p) = 0 for some a; and some c€C, which 
gives us our contradiction. 


Corotuary. Let Ay be an abelian variety defined over a field K, 
of finite type over Q. Then there exists an integrally closed ring R 
of finite type over Z having K as quotient field, and an abelian 
variety scheme A over R such that Ay is the extension of A to the 
ground field K, and A(R)= A(K). There exist infinitely many prime 
ideals » of R such that the residue class field k(p) is finite over Q, 
and that the map 


X > x10) 


of A(R) into A,(R()) is an injection on A(R). The above assertions 
remain true if Ay ts a complete non-singular curve of genus =1, 
and A 1s a scheme of such curves. - 

Proof. The first assertion is standard, and we can also make R 
integral over a ring /[t,,---,t,] where J is a ring generated over 
Z by 1/p:, ---,1/bm for a finite number of primes p;, and f,, ---, t, 
are algebraically independent over Q (using the Noether normaliza- 
tion theorem or Proposition 2 of Chapter I, §1). We can now use 
Propositions 5 and 6, the latter to apply Hilbert’s irreducibility 
theorem. The generalized Mordell-Weil theorem (Theorem 1 of 
Chapter V) gives us the finite generation of A(A), and consequent- 
ly we can also apply Theorem 3, in the case of abelian varieties. 

In the case of curves of genus 21, we select R large enough 
so that a canonical map of the curve into its Jacobian is defined 
over R, and that the family of such maps is non-degenerate. Then 
the points of the curve may be viewed as a subset of the points 
of its Jacobian, and our assertion is reduced to the case of abelian 
varieties. 

By means of this corollary, one can reduce certain qualitative 
assertions concerning points of curves over finitely generated rings 
and fields over Q to assertions about points on curves in number 
fields. For instance, the finiteness of integral points proved in 
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Chapter VII, for fields of finite type, is thus seen to be implied 
by the finiteness of integral points over finitely generated subrings 
of number fields over Z, provided one has Theorem 1 of Chapter V. 
In a like manner, the Mordell conjecture could be so reduced, if 
one could prove it in number fields. 


Historical Note: 


The proof of Hilbert’s theorem over the rationals is due to Dérge 
(9], from which all of §2 has been more or less copied. The 
reduction steps occur in Hilbert’s original paper [12]. 

Siegel [37[ had already observed the effect of his theorem on 
Hilbert’s. The idea of counting rational points on curves of genus 
=1 and finding that their number is much smaller than the num- 
ber of points on curves of genus 0 is due to Néron [24]. 

The proof of the function field case is due to Franz [10]. 

The theorem of §6 concerning the possibility of specializing a 
finitely generated subgroup of an abelian variety injectively is due 
to Néron [24]. By this method, he gives examples of elliptic cur- 
ves having non-trivial groups of rational points over certain num- 
ber fields. It is unknown whether one can construct elliptic 
curves over the rationals such that the group of rational points 
has arbitrarily high rank. There is a general feeling that this 
connot be done. One could try to investigate the geometric case 
of a rational pencil of elliptic curves on a surface, as Igusa does 
[14], and to raise the same question for surfaces. 

It would also be interesting to investigates to what extent the 
specialization theorem remains valid for non-commutative groups. 
For instance, given a finitely presented group of matrices in charac- 
teristic 0, does it always have an injective specialization into a 
group of matrices all of whose coefficients are algebraic numbers? 
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I include here the notation used most frequently, together with 
a short informal description of each term. 


ie ee 


K 
CP) 


Invertible elements in a ring R (resp. non-zero elements 
of a field K). 


: Algebraic closure of a field K. 
: Field obtained by adjoining to K a set of affine coordi- 


nates for a point P (equal to K(%o/%i, «++, %n/xi) if (%0, «++, Xn) 
are projective coordinates for P, x; # 0). 


: Number of elements in 0/p. 

: Extended by linearity from Nb. 

: Norm from £ to K. 

: Set of rational points of a variety over K. 

: Proper set of absolute values. 

: bv) if dD is an M;-divisor. 

Sane 

: The product Iy,ex, ||D| |» 

: Height determined by the map ¢ into projective space. 
: Relative height to a field K. 

: Height determined by a mapping derived from the linear 


system &(X). 


: Linear equivalence, or equivalence of functions (each is 


less than a constant multiple of the other). 


: Algebraic equivalence, or quasi-equivalence of functions 


(same as above, but with 1 + « and 1 — « in the exponent). 
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